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Preface 


This is the second edition of a text that is intended for a one-semester course in algebraic 
number theory for senior undergraduate and beginning graduate students. The Table of 
Contents on pages vii—viii is essentially self-descriptive of each chapter’s contents, requir- 
ing no need for repetition here. What differs from the first edition deserves elucidation. 
Comments from numerous instructors and students over more than a decade since the first 
edition appeared have given way to a new style, methodology, and presentation. 


The focus has changed from the first edition approach of introducing algebraic numbers 
and number fields in the first two chapters and leaving ideals until Chapter 3, to the second 
edition strategy of looking at integral domains, ideals and unique factorization in Chapter 
1 and field extensions including Galois theory in Chapter 2. This changes the first edition 
method of having the entirety of Galois theory relegated to an appendix and bringing it, 
in this edition, to the main text in a more complete, comprehensive, and involved fashion. 
Chapter 3 in this edition is devoted to the study of class groups, and as a new feature, not 
touched in the first edition, we include the study of binary quadratic forms and comparison 
of the ideal and form class groups, which leads into the general ideal class group discussion 
and paves the way for the geometry of numbers and Dirichlet’s Unit Theorem. In the first 
edition, this was done in Chapter 2 along with applications to the number field sieve. In this 
edition, the applications are put into a separate Chapter 4 including the number field sieve in 
84.5, introduced via §4.4 on factoring, including Pollard’s cubic factoring algorithm, which is 
more comprehensive than that of the first edition. In turn, §4.1-§4.3 are applications leading 
to the latter that involve solutions of Diophantine equations including Bachet, Fermat, and 
prime power representation. This includes Kummer’s proof of Fermat’s Last Theorem (FLT) 
for regular primes, Case I, which was put into Chapter 3 in the first edition. This edition 
maintains the inclusion of Bernoulli numbers, the Riemann zeta function, and connections 
via von Staudt—Clausen to the infinitude of irregular primes. Applications also appear at 
the end of Chapter 5 with an overview of primality testing and, as an application of the 
Kronecker—Weber Theorem, Lenstra’s primality test employing the Artin symbol. A special 
case of this test is presented as the Lucas-Lehmer test for Mersenne primes. 


Chapter 5 replaces Chapter 4 of the first edition in its discussion of ideal decomposition in 
number fields but spreads out the number of sections to more evenly present the material. 
One feature of the second edition that distinguishes it from the first is that there is much 
less emphasis on using exercises with the framework of proofs in the main text. Exercises 
are referenced in the proofs only when they represent material that is routine and more 
appropriate for a student to do. Throughout the text, this is one of the major changes. In 
particular, in the proof of the Kronecker-Weber Theorem, as well as in the proofs of the 
reciprocity laws in Chapter 6, what were exercises in the first edition are now explained in 
full in the main text. Moreover, exercises in this edition are designed to test and challenge 
the reader, as well as illustrate concepts both within the main text as well as extend those 
ideas. For instance, in the exercises for §2.1, Galois theory is expanded from the number 
field case to finite fields and general fields of characteristic zero which is then invoked in 
§5.4 to discuss residue class fields and connections with the Frobenius automorphism. Thus, 
the reader is led at a measured pace through the material to a clear understanding of the 
pinnacles of algebraic number theory. What is not included from the first edition is any 
separate discussion of elliptic curves. This is done to make the text more self-contained 
as a one-semester course for which the addition of the latter is better placed in a related 
course such as given in [54]. Also, the numbering system has changed from the first edition 
consecutive numbering of all objects to the standard method in this edition. 
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x Algebraic Number Theory 


@ Features of This Text 


e The book is ideal for the student since it is exercise-rich with over 310 problems. The 
more challenging exercises are marked with the symbol v«. Also, complete and detailed 
solutions to all of the odd-numbered exercises are given in the back of the text. Throughout 
the text, the reader is encouraged to solve exercises related to the topics at hand. Complete 
and detailed solutions of the even-numbered exercises are included in a Solutions Manual, 
which is available from the publisher for the qualified instructor. 


e The text is accessible to anyone, from the senior undergraduate to the research scientist. 
The main prerequisites are the basics of a first course in abstract algebra, the fundamentals 
of an introductory course in elementary number theory, and some knowledge of basic matrix 
theory. In any case, the appendices, as described below, contain a review of all of the 
requisite background material. Essentially, the mature student, with a knowledge of algebra, 
can work through the book without any serious impediment or need to consult another text. 


e There are more than forty mini-biographies of those who helped develop algebraic number 
theory from its inception. These are given, unlike the footnote approach of the first edition, 
in boxed highlighted text throughout, to give a human face to the mathematics being 
presented, and set so they do not interfere with the flow of the discourse. Thus, the reader 
has immediate information at will, or may treat them as digressions, and access them later 
without significantly interfering with the main mathematical text at hand. Our appreciation 
of mathematics is deepened by a knowledge of the lives of these individuals. I have avoided 
the current convention of gathering notes at the end of each chapter, since the immediacy 
of information is more important. 


e There are applications via factoring, primality testing, and solving Diophantine equations 
as described above. In $4.5, we also discuss the applications to cryptography. 


e The appendices are given, for the convenience of the reader, to make the text self- 
contained. Appendix A is a meant as a convenient fingertip reference for abstract algebra 
with an overview of all the concepts used in the main text. Appendix B is an overview 
of sequences and series, including all that is required to develop the concepts. Appendix 
C consists of the Greek alphabet with English transliteration. Students and research math- 
ematicians alike have need of the latter in symbolic presentations of mathematical ideas. 
Thus, it is valuable to have a table of the symbols, and their English equivalents readily 
at hand. Appendix D has a table of numerous Latin phrases and their English equivalents, 
again important since many Latin phrases are used in mathematics, and historically much 
mathematics was written in Latin such as Bachet’s Latin translation of Diophantus’ Greek 
book Arithmetica. 


e The list of symbols is designed so that the reader may determine, at a glance, on which 
page the first defining occurrence of a desired notation exists. 


e The index has over two thousand entries, and has been devised in such a way to ensure 
that there is maximum ease in getting information from the text. There is maximum cross- 
referencing to ensure that the reader will find ease-of-use in extracting information to be 
paramount. 


e The bibliography has over seventy entries for the reader to explore concepts not covered in 
the text or to expand knowledge of those covered. This includes a page reference for each 
and every citing of a given item, so that no guesswork is involved as to where the reference 
is used. 

e The Web page cited in the penultimate line will contain a file for comments, and any 
typos/errors that are found. Furthermore, comments via the e-mail address on the bottom 
line are also welcome. 
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Suggested Course Outlines 


A glance at the Table of Contents will reveal that there is a wealth of material beyond 
a basic course in algebraic number theory. This section is intended for the instructor, by 
giving several routes from a course in the basics of algebraic number theory to a more 
advanced course with numerous applications, as well as other aspects such as Kummer’s 
proof of FLT for regular primes, and advanced reciprocity laws. 


Chapters 1 through 3 are essential as a foundation, whereas Chapter 4 is optional, and the 
instructor may skip it or add any section as an application of the material in the previous 
chapters. §4.4-§4.5 go together as advanced material on factoring, with §4.4 preparatory 
material using Pollard’s algorithm to set the stage for the description of the number field 
sieve in 84.5. 

In §5.1-85.4, the groundwork is laid for ramification theory. However, in §5.5, the theory of 
Kummer extensions and applications to Kummer’s proof of FLT for regular primes in the 
second case may be eliminated from a basic course in algebraic number theory. $5.6 on the 
proof of the Kronecker—Weber theorem, is a significant application of what has gone before, 
but is again not necessary for a basic course. §5.7 is an applications section on primality 
testing. 

In a bare-bones course, one does not need to proceed into Chapter 6. However, the chap- 
ter illustrates some of the pinnacles of algebraic number theory with proofs of the cubic, 
biquadratic, and Eisenstein reciprocity laws, as well as development of the Stickelberger re- 
lation. In a more advanced course, these topics should be included. The following diagram 
is a schematic flow-chart to illustrate the possible routes for the course, from the most basic 
course to one filled with applications. 
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Chapter 1 


Integral Domains, Ideals, and 
Unique Factorization 


Take care of your body with steadfast fidelity. The soul must see through these eyes 
alone, and if they are dim, the whole world is clouded. 

Johann Wolfgang von Goethe (1749-1832), German poet, novelist, and 
dramatist 


In this chapter, we introduce integral domains, and develop the concepts of divisibility, 
irreducibility, and primes which we apply to Dedekind domains. This preamble allows us to 
develop Noetherian, principal ideal, and unique factorization domains later in the chapter 
thereby setting the foundation for the introduction of algebraic number rings and number 
fields. The reader should be familiar with some basic abstract algebra, such as groups, rings, 
and fields and their properties, which are reviewed in Appendix A, starting on page 319, 
for convenience and finger-tip reference. 


1.1 Integral Domains 


In order to define concepts in the sequel, we will need the following. 
Definition 1.1 — Units 


An element a in a commutative ring R with identity 1p is called a unit in R when there is 
a 8B € Rsuch that a8 =1R. The multiplicative group of units in R is denoted by Up—see 
Exercise 1.7 on page 6. 
Example 1.1 In Z[\/2] = R, 1+ V2 is a unit, since 

(1+ V2)(-14+ V2) =1g=1. 


For the following, recall that a zero divisor in a commutative ring R is a nonzero element 


a € Rsuch that a8 = 0 where 6 4 0. 


Definition 1.2 — Integral Domains 


An integral domain is a commutative ring D with identity 1p, having no zero divisors. In 
particular, if every nonzero element is a unit, then D is a field. 


1 


2 1. Integral Domains, Ideals, and Unique Factorization 


Application 1.1 — The Cancellation Law 


One of the most important properties of an integral domain D is that the cancellation law 
holds, namely if a,8 € D with a nonzero and af = ay, then 6 = 4. 


Example 1.2 The ordinary or rational integers 
Z = {...,—2,—1,0,1,2,...} 


provide us with our first example of an integral domain. 


Example 1.3 For any nonsquare integer n, 
Z[Vn] = {a+bVD : a,b € Z} 


is an example of an integral domain. For example, if nm = —1, we have the Gaussian 
integers. Indeed, n = —1 yields /—1 = 7 which is an example of a special kind of unit, the 
generalization of which we now define. 


Definition 1.3 — Primitive Roots of Unity 


For m € N = {1,2,3,...} the natural numbers Gy, denotes a primitive m™ root of unity, 
which is a root of x™ — 1, but not a root of x? — 1 for any natural number d < m. 


th 


Example 1.4 With reference to Example 1.3, where n = —1, ~—1 =i = ¢ is a primitive 
fourth root of unity, since it is a root of e+ — 1, but not root of #7 — 1 for 7 = 1,2,3. Also, 


¢3 = (-14+ v-3)/2 
is a primitive cube root of unity, since it is a root of x? — 1, but clearly not a root of x? —1 


or x—1. 


Example 1.5 Suppose that p is a prime and ¢, is a primitive p-th root of unity. If we set 


p-1 
z= 91S 
j=0 
then 
p-l p—l1 
wp = DIG = 0G =a. (1.1) 
7=0 7=0 


Thus, if « £ 0, dividing through (1.1) by x gives ¢, = 1, a contradiction. Thus, 
1+Q4+G+---+07* =0. 


This fact will prove useful when discussing notions surrounding roots of unity later in 
the text—see Exercise 2.25 on page 69, for instance. Also, we generalize this example in 
Exercise 6.28 on page 310. 


1.1. Integral Domains 3 


Example 1.3 is a motivator for the more general concept, which later turns out to be the 
so-called “ring of integers of a quadratic field”——see Theorem 1.28 on page 45. 


Application 1.2 — Quadratic Domains and Norms 


If n is a nonsquare integer, then Z[,/n] is an integral domain as given in Example 1.3, where 
we note that Z[,/n] is a subset of the field Q(./n). We call domains in Q(./n) quadratic 
domains. There is a slightly larger subset of Q(./n) that is also an integral domain when 
n = 1(mod 4)—see Exercise 1.1 on page 6 


Z |; ae C QA Vn). 


Now we may combine Example 1.3 with this application to describe some special quadratic 
domains as follows. Define 


Z[wn] = {a+ dwn : a,b € Z}, 


where 


Jn if n #1 (mod 4). 


Then Z[w,,] is a quadratic domain. 


- -| (1+ ./n)/2 ifn =1(mod 4), 


Another concept we will see in greater generality later, but applied here to quadratic do- 
mains, is the quadratic norm N : Q(./n) + Q via 


N(a+t bn) = (a+ b/n)(a — b/n) = a? — nb? €Q. 


In particular, by Exercise 1.3 


a € Uz, if and only if N(a) = +1. 


We will be using the concept of a norm throughout our discussion to establish properties 
of, in this case, quadratic domains, or in general, rings of integers, that we have yet to 
define—see Definition 1.30 on page 36. 


The notion of divisibility of elements in an integral domain is a fundamental starting point 
for understanding how algebraic number theory generalizes the notions of “divisibility,” 
“primality,” and related concepts from the integers Z to other integral domains such as 
Z[wn]- 


Definition 1.4 — Divisors and Trivial Factorizations 


Ifa,@ € D an integral domain, then a is said to be a divisor of 8, if there exists an element 
y € D such that 6 = ay, denoted by a | 6. If a does not divide 6, then we denote this by 
at 8. If 6 = ay, where either a € Up or y € Up, then this is called a trivial factorization 


of £. 


Example 1.6 Consider the notion of units given in Definition 1.1 on page 1 and the 
illustration given in Example 1.1. Then we have that both (1+ V2) | 1 and (-1+ v2) | 1. 
Indeed, this may be said to characterize units in D, namely 


a isa unit in an integral domain D if and only if a | 1. 


This may be used as an alternative to that of Definition 1.1. The following notion allows 
for the introduction of a different approach. 


4 1. Integral Domains, Ideals, and Unique Factorization 


Definition 1.5 — Associates 


If D is an integral domain and a,8 € D witha | B and 6 | a, then a and 6 are said to be 
associates, and we denote this by a ~ (@. If a and £ are not associates, we denote this by 


a B. 


Example 1.7 From Definition 1.5 and Example 1.6, we see that a is a unit in an integral 
domain D if and only ifa~ 1. Furthermore, if a ~ 8 for any a,@ € D, then there is a unit 
u € D such that a = u@. To see this, since a | 8, then there is a y € D such that 8 = ya. 
Conversely since ( | a, there is a 6 € D such that a = 66. Hence, a = 68 = bya, so by the 
cancellation law exhibited in Application 1.1 on page 2, 1 = dy, so 6 = y~! = wis a unit 
and a = uf. 


Example 1.8 In Z[V10], 2+ /10 ~ 16 + 5v/10 since 

16 + 5V10 = (2+ V10)(3 + V'10), 
so (2+ V/10) | (16 + 5/10), and 

2+ V10 = (16 + 5V10)(—3 + V'10) 
so (16 + 510) | (2+ V10). 


Example 1.9 Since 


6 = (44 V10)(4— V10), 
then (4+ V/10) | 6 in Z[V10}. 


Notice that 6 = 2-3 so it appears that 6 does not have a “uniqueness of factorization” 
in Z[V10] in some sense that we now must make clear and rigorous. Now we develop the 
notions to describe this phenomenon which is distinct from Z where 6 does have unique 
factorization via the Fundamental Theorem of Arithmetic. In fact, in Z, a prime, is defined 
to be an integer p such that the only divisors are +1 and +p. Thus, primes satisfy that 


if p | ab, then either p | aor p | b (1.2) 


—see [53, Lemma 1.2, p. 32]. Also, primes in Z satisfy that 


if p = ab, then a = +1 or b= 41. (1.3) 


The following generalizes property (1.3) to arbitrary integral domains. Then we will discuss 
property (1.2) and show how (1.2)-(1.3) generalize to similar notions in general integral 
domains. 


Definition 1.6 — Irreducibles 


If D is an integral domain and a nonzero, nonunit element 6 € D satisfies the property that 
whenever { = ay, then either a € Up or y € Up, then £ is said to be irreducible. In other 
words, the irreducible elements of D are the nonzero, nonunit elements having only trivial 
factorizations. If a nonzero, nonunit element of D is not irreducible, it is called a reducible 
element. 


1.1. Integral Domains 5) 


Example 1.10 Any prime p € Z is irreducible, since its only factorizations are p = 
(+1)(+p). Conversely, if n € Z is irreducible, then the only factorizations are trivial so n 
is prime in Z. In other words, in Z, p is prime if and only if it is irreducible. This fails to 
be the case in arbitrary integral domains and this provides the fodder for algebraic number 
theory. 


Example 1.11 Consider 
D=Z([Vv10| and 6 = 4+ V10. 


If 8 is not irreducible, then 6 = ay, where neither a nor 7 is a unit in Z[V10]. Since 


by Exercise 1.2 on the next page, then without loss of generality 
N(a) = N(a+ bv 10) =3, 
where a =a+by/10. Thus, a? — 10b? = 3 so the Legendre symbol equality holds: 


-()-3)-@)-0 


a contradiction, so 4+ v 10 is irreducible. Similarly, its conjugate 4— V10 is irreducible. 


Via Example 1.9, we have 4+ 10 divides 6 but by Exercise 1.4, 4+ V10 divides neither 2 
nor 3. This motivates the next concept, generalizing (1.2). 


Definition 1.7 — Primes 


If 6 is a nonzero, nonunit in an integral domain D, then ( is called a prime if whenever 
B | ay, then either 6 | aor B | 7. 


Example 1.12 From Example 1.11 we see that 4+ V/10 are not primes in Z[V10]. Now 
we show that 2,3 are not primes in Z[V10] (although they are primes in Z). From Example 
1.9, 2 and 3 both divide (4+ /10)(4 — 10). However, by Exercise 1.4 on the following 
page, neither of them divides 4+ /10, so neither is prime. Yet by Exercise 1.4 both are 
irreducible. This illustrates the departure, in general integral domains, from the case in Z, 
where all irreducibles are prime as shown in Example 1.10. Yet, the following shows us that 
primes are always irreducible. 


Theorem 1.1 — Primes Are Irreducible 


If D is an integral domain and ( € D is prime, then @ is irreducible. 


Proof. Let 8 € D be prime and suppose that 6 = ay. Then a fortiori, 6 | ay so B | a 
or B | y. Without loss of generality, assume that 8 | a. Then there is a 6 € D such that 
a = 86. It follows that 8 = ay = Gdy, so by Application 1.1 on page 2, 1 = dy, which 
makes y a unit in D. Hence, § is irreducible. 


Remark 1.1 We have seen that the converse of Theorem 1.1 does not hold. Now our 
task is to determine those integral domains for which it does hold. This will involve making 
precise the notion of “unique factorization” of elements in general integral domains. We 
begin this delineation in §1.2. 
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Exercises 


1.1. 


1.2. 
1.3. 
1.4. 


1.5. 


Let n be a nonsquare integer. Prove that if m = 1(mod 4), then the subring 
Z|wn] S Qvn) 


given in Application 1.2 on page 3, is an integral domain. Conclude that Z[,/n], for 
any nonsquare n, is an integral domain by similar reasoning. 


Prove that norms in quadratic domains are multiplicative, i.e. N(ay) = N(a)N(q). 


Prove that an element a in Z[w,] is a unit if and only if N(a) = +1. 


Prove that in a quadratic domain D, if a | GB in Zlwp], then N(a) | N(G) in Z. 
Conclude that 4+ V10 are not associates of either 2 or 3 in Z[V10]. Also, conclude 
that 2 and 8 are irreducible in Z[V10]. 


Let D = Z[w,] be a quadratic domain and let a € D satisfy the property that 
|N(a)| = p, a prime in Z. Prove that a is irreducible in D. Provide either a proof or 
a counterexample to the converse: If a € D is an irreducible element, then |N(a)| is 
a prime in Z. 


. Prove that 2 is irreducible but not prime in Z[V/—5]. 
. Prove that the units of an integral domain form a multiplicative abelian group. 


. Prove that the relation ~ given in Definition 1.5 on page 4, is an equivalence relation, 


namely that it is reflexive: a ~~ a, symmetric: a ~ b implies b ~ a, and transitive: if 
a~ band b~c, then a~ c for all a,b,c € D. 


. Prove that in an integral domain D an element a is irreducible if and only if every 


divisor of q@ is either an associate of a or a unit. 


. If D is a quadratic domain show that if a,8 € D with a ~ £6, then |N(a)| = |N(6)|. 


. Is the converse of Exercise 1.10 true? If so prove it, and if not, provide a counterex- 


ample. 


. Find an a € Z[V15] such that a = ajazg = $162 where a,,§; are irreducible for 


j =1,2 but neither of a1, a2 is an associate of 8; for 7 = 1,2. 


. Apply the question in Exercise 1.12 to Z[V30]. 
. Show that 1+ 7 =1-+ /—1 is prime in the Gaussian integers Z|?]. 


. Find all units in the Gaussian integers Z|i]. 


. Prove that +(1+/2)”" € Uy) ya for all n € Z. Prove that there are no other units in 


Z[V2]. In other words, show that 
Up V3 = {+(1 + J2)" Ine Z}. 


. If D is an integral domain and a,@ € D, not both zero, then y € D is called a greatest 


common divisor (gcd) of a and @ if the following two conditions are satisfied. 
(a) y| a, and | 8. 
(b) Ifo | a, and o | 8 for some o € D, then a | 7. 


Prove that any two gcds must be associates. Also, provide an example of a ring in 
which elements exist that have no greatest common divisor. 
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1.2 Factorization Domains 


Not everything that can be counted counts, and not everything that counts can be 
counted. (Attributed) 
Albert Einstein (1879-1955), German-born theoretical physicist 


In this section we explore and solidify the notions of unique factorization in certain integral 
domains and the intimate connection with the core features of algebraic number theory 
which this engenders. 


Definition 1.8 — Factorization Domains 


If D is an integral domain in which every nonzero, nonunit can be represented as a finite 
product of irreducible elements of D, then D is called a factorization domain. A factorization 
domain in which any nonzero, nonunit can be expressed as a product of irreducibles that 
is unique up to units and the order of the factors is called a unique factorization domain 
(UFD). 


Remark 1.2 Definition 1.8 says that D is a unique factorization domain when the follow- 
ing occurs. Suppose that a € D is arbitrarily chosen with 


alo stag pia 
where u € Disa unit, n,a; € N and 4; is irreducible for 7 = 1,2,...,n. Then D is a unique 


factorization domain if any other representation: 


a= ona abe 
where v € Dis aunit, m,b; € N, and «; is irreducible for 7 = 1,2,...,m, implies that m = n 
and after possibly rearranging the y;, we have y; = 4;, and a; = b; for 7 =1,2,...,n. 


Now we look at a criterion for a factorization domain to be a unique factorization domain in 
terms of the concepts we studied in §1.1. This will be the defining feature of such domains 
in terms of these concepts. 


Theorem 1.2 — Unique Factorization—Irreducibles Are Prime 


If D is a factorization domain, then D is a unique factorization domain if and only if every 
irreducible element of D is prime. 


Proof. Assume that all such factorizations are unique. If a € D is irreducible, we must 
show that a is prime. If a | 76, there exists aa € D such that y8 = ao. Each of 6,7,0 
has unique factorization, so write 


r s t 
6=u]] 4, y=v| [a o=w][o, 
j=l j=l j=l 


where u,v,w € Up, and each §;,7;,0; is irreducible. Thus, 


t s r 
aw |] o; =ao =78 = vu] [] 4; 
j=l =i <JF=h 
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Since a is irreducible, then by unique factorization, a is an associate of one of the 8; or y;. 
In other words, a | 6, ora | 7, SO @ is prime. 


Conversely, assume that every irreducible in D is prime. Suppose that 
UQ1**+ Ay = 0f,-+: 8, forr >s>1, and u,v € Up,, (1.4) 


with a;, 8; irreducible. We must show that r = s, and that each a; is an associate of some 
Be. We use induction on r. If r = 1, then s = 1, so we are done. Assume that unique 
factorization holds for all factorizations of length at most r—1 > 1. Since 6, | UAL +++ Ap, 
then {5 | a; for some i € {1,2,...,r}, since 6, is not a unit. Thus, 8, is an associate of a;. 
Renumber the a; so that a; = a,. Thus, by Application 1.1 on page 2, we may cancel the 
a, = Bsw (where w is a unit) from each side of Equation (1.4) to get 


UA +++ Mp1 = W VBL +++ Bs—1- 


By the induction hypothesis, r — 1 = s — 1, and the a; are associates of the 8;. The result 
now follows by induction. 


Remark 1.3 Theorem 1.2 provides the key to understanding unique factorization in in- 
tegral domains, namely the failure of unique factorization is the failure of (some) set of 
irreducibles to be prime. 


In Exercise 1.17 on page 6 we defined greatest common divisors in integral domains, but 
there, only sought to find domains without gcds. Now we will look at an example of an inte- 
gral domain where such divisors always exist. This provides a motivator for a more general 
class of domains where there is a “norm” similar to that we found in quadratic domains 
introduced in Application 1.2 on page 3. These domains are important in our understanding 
of the basics. First we need to establish a division algorithm. !!We specialize to Gaussian 
integers as a motivator for what follows. Recall from the definition in Application 1.2 on 
page 3 that the norm N is defined for any quadratic domain. 


Theorem 1.3 — Division Algorithm for Gaussian Integers 
Let a,6 € Zt] with 8 40. Then there exists 0,6 € Z[i] such that 


a= Bo +6, 
where 0 < N(é) < N(8). 
Proof. Let a/8 =e+di € C. Set 
f =|e+1/2| = Ne(c), and g = |d+1/2| = Ne(d), 


where Ne(z) is the nearest integer function. Here |y| is the floor function or greatest integer 
function—see [53, §2.5]. Hence, there are k,é € R such that 


|k| < 1/2, and |é| < 1/2 (1.5) 


11The term algorithm is derived from the Persian mathematician Mohammed ibn Musa al-Khowarizmi 
Mohammed, son of Moses of Kharezm, now Khiva (circa 790-850 A.D.). His book Algorithmi de Numero 
Indorum, the Latin translation of the no longer extant original Arabic text, was highly influential in bringing 
the Hindu-Arabic number system to Europe. Shortly after the appearance of these Latin translations, 
readers began contracting his name to algorism, and ultimately algorithm, which we use today to mean any 
methodology following a set of rules to achieve a goal. 
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with 
ct+tdi=(f+k)+(g4+ £1. (1.6) 


Set 
o0=f+t+giandd=a- {8o. (1.7) 


Then it remains to show 
0< N(6) < N(8). 


We know that N(6) > 0, since the norm is just a sum of two squares. Now we show that 
N(6) < N(£). 
By Exercise 1.2 on page 6 (the multiplicativity of the norm), we have 
N(5) = N(a— Bo) = N((a/8 — 0) 8) 
= N(a/6 — 0)N() = N(c+ di —0)N(8). 
However, from (1.6)—(1.7), we get 
c+di-o=c+4+di-(ft+gi)=(ce-f)+(d-gi=k+h. 
Therefore, by (1.5), 


N(65) = N(k + £i)N(B) = 
(k? + @)N(B) < (1/2) + (1/2)*)N(B) < N(B)/2 < N(B), 


as required. 


Remark 1.4 Theo in Theorem 1.3 is called a quotient and the 6 is called a remainder 
of the division. This follows the notions for the division algorithm in Z. 


Remark 1.5 Although Theorem 1.3 gives us a criterion for the existence of an algorithm 
for division in Z[?], there is no uniqueness attached to it. In other words, we may have 
many such representations as the following illustration demonstrates. 


Example 1.13 Let a = 10+i% and 8 = 2+ 5i, then we may find 0,6 € Z[i] using the 
techniques established in the proof of Theorem 1.3. We have 
a 10+2 (10+%)(2—5i) 25 48. 


c+di= 


B 2+5i (Q+5i)(2—-5) 29 29” 


1 25 A 1 48 1 
p=|e+5|=|Rtg| —tande=|a+5] =| 3 t3|7 2. 


Therefore, 9 = 1— 2i and 6 = a— Bo = 10+%—(2+52)(1— 2%) = —2. Moreover, we verify 


sO 


N(6) = N(—2) =4 < N(8) = N(2 + 5i) = 29 
with 
a=10+7= (24 52)(1 — 27) -2 = Bo + 6. (1.8) 


However, these choices are not unique since we need not follow the techniques of Theorem 
1.3. For instance, if we choose o = 1 —7 and 6 = 3 — 2i, then 


a=10+i=(245i)(1—i)+3-2 = Bo + 6, (1.9) 


where N(b) = 13 < 29 = N(2+5i) = N(S). Thus, by (1.8)-(1.9), we see that, when 
employing the division algorithm for Gaussian integers, the quotient and remainder are not 
unique. 
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Now we look at an integral domain where the existence of gcds is guaranteed, namely the 
Gaussian integers. 


Theorem 1.4 — Gaussian GCDs Always Exist 


If a,8 € Zt] = D, where at least one of a or f is not zero, then there exists a gcd y € Z[i] 
of a and 8B. 


Proof. Given fixed a,3 € Zi], not both zero, set 
8 ={N(ca+t pf) >0:a,p € Ziil}, 
with 8 £ @ since 
N(a) = N(1-a+0- 8), and N(B) = N(0-a+1- 8) (1.10) 


at least one of which is not zero and nonnegative, then at least one of them is in 8. Thus, we 
may employ the well-ordering principle—see page 340—to get the existence of an element 
Yo = 09a + PoP, for which its norm is the least value in 8, namely 


N(yo) < N(oa + pf) for all o,p € Zi). 
Claim 1.1 po is a greatest common divisor of a and /. 


Let 7 € Zi] with + | qa and T | 8. Then there exists 01,62 € Z[i] such that @ = 7d, and 
8B =762. Hence, 


Yo = 90 + poB = o0T61 + poTb2 = T(9051 + pod2), (1.11) 


SO T | yo. It remains to show that yo divides both a and £. 


Let 
k= Aya4+ AG (1.12) 


be such that N(«) € 8. Thus, by Theorem 1.3 on page 8, there exist u,v € Zli] such that 
ey ee (1.13) 


with 
0< N(v) < N(%). (1.14) 


Also, by (1.12)—(1.13), 
VY =K— oH = A1a+ A2B — (Goa + poB)M = (Ai — Gop)a t+ (A2 — por)8, 


so if vy £0, then N(v) € &. However, by (1.14), this contradicts the minimality of N(yo) in 
&, so vy = 0. We have shown that yo divides every element whose norm is in §. In particular, 
by (1.10)—(1.11), it divides a@ and 6, which secures claim 1.1. Hence, we have the result. 


Now we may look at the promised extension of the idea of a norm from Gaussian integers 
to a distinguished class of integral domains, which have more general functions describing 
them. 


Definition 1.9 — Euclidean Domains and Functions 


If D is an integral domain, then a mapping ¢: D+ Z is called a Euclidean function if it 
satisfies the two conditions: 
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(a) If a € D, ¢(a8) > (a) for all nonzero 6 € D. 
(b) Ifa,6 € D with 6 # 0, there exist 7,6 € D such that a = 78+ 6 and 4(6) < ¢(). 


When D possesses a Euclidean function then D is called a Euclidean domain. 


Example 1.14 In Z, ¢(z) = |z|, the usual absolute value, is a Euclidean function. Hence, 
Z is a Euclidean domain. 


Remark 1.6 In Definition 1.9 part (b), we cannot guarantee the uniqueness of the ele- 
ments 7,0. However, there are some distinguished domains for which they are unique. 


Example 1.15 If F is a field and D = F[z] is the polynomial ring in the indeterminate z, 
then 


o(f(@)) = deg(f(2)), 


the degree of f(x) € D is a Euclidean function on D. Note that if f(a) = 0, the zero 
polynomial, then 


deg(f(z)) = 1 


by convention. In this case, the values in part (b) of Definition 1.9 are unique—see [61]. 


We now examine integral domains having Euclidean functions for which the converse of 
Theorem 1.1 on page 5 holds, since this is a door leading into domains with unique factor- 
izations via Theorem 1.3 on page 8. First we need the following notion. 


Definition 1.10 — Field of Quotients 


If D is an integral domain, then the field F' consisting of all elements of the form a3~! for 
a,o € D with 6 £0 is called the field of quotients or simply the quotient field of D. 


Remark 1.7 There is, in actuality, an isomorphic copy of D in F, but in practice it is 
standard to assume that D is identified with this copy. In the case of a quadratic domain it 
is clear from Application 1.2 on page 3 that the quotient field of Z[w,]| is F = Q(./n)—see 
Theorem 1.28 on page 45. 


Example 1.16 If F is any field, then the quotient field of the polynomial domain F'[z] is 
the field F(x) of rational functions in x. Moreover, the quotient field of Z is Q. 


Definition 1.11 — Norm-Euclidean Quadratic Domains 


A quadratic domain D with quotient field F' is said to be norm-Euclidean if 


for any p € F there exists a 0 € D such that |N(p—o)| < 1. (1.15) 


Now we demonstrate that the condition in Definition 1.11 is tantamount to the norm being 
a Euclidean function. 


Theorem 1.5 Let D be a quadratic domain. Then D is a Euclidean domain with respect 
to the norm function if and only if condition (1.15) holds. 
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Proof. Suppose that (1.15) holds. If a,8 € D with 6 4 0, then by Exercise 1.2 on page 6 
|N(a8)| = |N(a)||N(8)| = LN (@)| 

which is part (a) of Definition 1.9. It remains to show part (b) holds. If a,8 € D, then by 
(1.15) there exists a a € D such that 

|N(a/B—o)| <1. (1.16) 
Hence, if we let 

6=a-<af8, 
then 
|N(5)| = |N(a@ — o8)| = |N((a/B)8 — o8)| = |N(a/B — o)|-|N(B)| < |N(8)| 

by (1.16) which establishes (b). 
Conversely, if N is a Euclidean function on D, then for any p = a/8 € Q(./n), with 
a,3 € D, we have by part (b) of Definition 1.9 that there exist y,d € D such that 


a=y78+6 with N(d) < N(). 


Qa a— 7p ) 
N 3-1) =¥( =" (5) <1. 
( B B B 
This establishes (1.16) and so the entire result. 
The following turns out to be one of two possible domains Z[,/n] which is norm-Euclidean 
for n a negative squarefree integer and we look at the positive case as well. (Note that the 


other n < 0 for which we get Euclidean domains are those of the form Z[(1 + ./n)/2]—see 
Theorem 1.28.) 


Therefore 


Example 1.17 We show that 
b(at bi) =a? +b? = N(a+ bi) 
is a Euclidean function on the Gaussian integers a + bi € Z[i] = D using Theorem 1.5. 
To see that D is norm-Euclidean, select p = q+ ri € Q(t). We must find o =a+bi € D 
with 
\(q—a)? + (r—6)?| <1. 
This is accomplished by choosing: 
a = Ne(q) and b = Ne(r) where Ne(x) = |%+ 1/2] for any x ER. 


It can be shown that the only other squarefree n < 0 for which Z[,/n] is norm-Euclidean 
is for n = —2. Indeed the a,b chosen above for n = —1 will work for n = —2 as well. If 
we allow for w,, as defined in Application 1.2 on page 3, then Z[(1 + ./n)/2] for squarefree 
nm <0 is norm Euclidean if and only if 


n €{- 3, -7,-11} 


—see [54, Theorem 1.15, p. 34]. 
The case for positive D is also settled due to the efforts of several mathematicians culmi- 
nating in the complete solution in the middle of the last century. The positive squarefree 


integers n for which Z[w,,] is norm-Euclidean are given as follows—see [54, Remark 1.19, 
Theorem 1.21, p. 50): 


n € {2,3,5,6, 7, 11, 13, 17, 19, 21, 29, 33, 37, 41, 57, 73}. 
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Remark 1.8 It can be shown that Theorem 1.4 on page 10 generalizes to any Euclidean 
domain. In other words, there always exist gcds for elements in Euclidean domains. This 
comes from the verifiable fact that in a Euclidean domain D with respect to a Euclidean 
function ¢, we may select any a = ao, B = Bo € D with apo 4 0 and {o Jao and recursively 
define ay = ort) + Vi with (75) < 0(8;), where ay = Bj-1 and B; = Vj-1- The smallest 
n €N such that y, = 0 yields 7,_1 as the gcd of a and 8—see [54, Theorem 1.14, p. 33]. 


Example 1.17 is an example of a more general phenomenon, namely that the converse of 
Theorem 1.1 on page 5 always holds for Euclidean domains. 


Theorem 1.6 — Euclidean Domains Are UFDs 


If D is a Euclidean domain then a € D is irreducible if and only if a is prime. 
Proof. First, we establish that D is a factorization domain. By part (a) of Exercise 1.18, 
(a) = o(1p) if and only if a € Up. 


In this case a is vacuously a product of irreducible elements. Hence, we may use induction 
on ¢(a). By Exercise 1.21, d(1p) < ¢(a). Assume that a ¢ Up, and that any 6 € D with 
(8) < (a) has a factorization into irreducible elements. If a is irreducible, we are done. 
Assume otherwise. Then a = $y for 8,y € Dand £,y ¢ Up. Thus, by property (a) of 
Euclidean domains given in Definition 1.9, (8) < ¢(a), and ¢(y) < ¢(a). By part (b) of 
Exercise 1.18, 


o(y) # O(a), and (8) # g(a). 


Hence, ¢(8) < ¢(a) and ¢(y) < ¢(a) so, by the induction hypothesis, both 6 and y have 
factorizations into irreducibles. Thus, so does a. We have shown that D is a factorization 
domain. 

In view of Theorems 1.1 on page 5 and 1.2 on page 7, we need only show that irreducibles 
are primes. Suppose that aly for some 8,y € D. If af 6, then given the irreducibility of 
a, the only common divisors of a and @ in D are units. In particular, 1p is a ged of a and 
B. By Exercise 1.19, there exist 0,6 € D such that 1p = 0a+ 066. Therefore, 


y¥=oay+ 6By. 


Since a|G7, then aly, so a is prime. 


Thus, via Example 1.17 we have the solution for squarefree D. 


Corollary 1.1 If n € Z is squarefree, then Z[w,] is a norm-Euclidean domain if and only 
if 


Wel= T0374, 9.3506. 7, 11, 13,17, 19 91) 90, 33,97 41, Brea 


The following is immediate from Theorem 1.6 and is implicit in the header thereof. However, 
the converse of the following result fails to hold—see Exercise 1.25. 


Corollary 1.2 If D is a Euclidean domain, then D is a UFD. 
Exercises 


1.18. Establish the following facts concerning Euclidean functions ¢ on an integral domain 
D, introduced in Definition 1.9 on page 10. 
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1.20. 


1.22. 


1.23. 
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(a) Ifa~ B then $(a) = 9(8). 

(b) Ifa | G6 and $(a) = 6(8) then a~ 8B. 
(c) a € Up if and only if ¢(a) = (1p). 
(d) ¢(a) > $(0) for all nonzero a € D. 


. With reference to Exercise 1.17 on page 6, prove that any common divisor y of a and 


8, where a,(@ are elements of a Euclidean Domain D, may be written in the form 
y=o0a+ 66 
for some 0,6 € D. 


Prove that condition (a) in Definition 1.9 on page 10 is equivalent to the condition 
(c) Ifa | B for a,8 € D, with B £0, then d(a) < ¢(8). 


. Prove that a Euclidean domain D with Euclidean function ¢ satisfies ¢(1p) < (a) 


for all nonzero a € D. 
If a € D, a UFD, and |N(q)| is prime, show that a is prime in D. 


Either provide a counterexample to, or prove the converse of the statement in Exercise 
1.22. 


. Prove that the condition in Definition 1.11 on page 11 is tantamount to the condition: 


Given a,8 € D with 6 #0, there exist 0,6 € D with a = Bo+6 and |N(6d)| < |N(8)]. 


. An integral domain D is said to be an almost Euclidean domain provided that: there 


exists a function ¢: Dts NU {0} called an almost Euclidean function, such that 


(a) (0) = 0 and g(a) > 0 for a £0 in D. 
(b) If 6 is a nonzero element of D then ¢(a@8) > (a) for all a € D. 
(c) For any a,3 € D with 6 4 0, one of the following holds. 

(i) There exists a y € D such that a = 6+. 

(ii) There exist x,y € D such that 0 < d(ax + By) < (8). 


Prove that an almost Euclidean domain is a UFD. 


(This topic was introduced by Campoli [9]. With reference to our discussion 
herein, he produced integral domains, such as his example 


Z{(-1— v—-19)/2] 


that are UFDs which are not Euclidean domains. Campoli called his example 
“almost Euclidean.” This resulted in the production of countereramples to the 
converse of Corollary 1.2 on the previous page. Later Greene [25] showed that 
the conditions given above for an almost Euclidean domain are equivalent to being 
a “Principal Ideal Domain” (PID) which we will study in §1.5 and revisit this 
topic—see Exercises 1.47-1.48 on page 34. It turns out that Euclidean domains 
are PIDs which in turn are UFDs. However, neither converse holds. Examples 
of UFDs that are not PIDs are the hardest to produce and hence the above delin- 
eation. More recently, such as in [31], almost Euclidean spaces have been used 
for applications in complexity theory and error-correcting codes. 
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1.3. Ideals 


Intelligence without ambition is a bird without wings. 
Salvador Dali (1904—1989), Spanish painter 


In this section we set the stage for the introduction of two types of domains based upon the 
theory of ideals which will elevate the factorization debate from elements to ideals. This 
allows us to witness the influence of Dedekind and others on the development of algebraic 
number theory. Some of the following is adapted from [54]. 


Definition 1.12 — Ideals 


An R-ideal is a nonempty subset IJ of a commutative ring R with identity having the 
following properties. 


(a) Ifa,8 ET, thna+ el. 
(b) IfaeTandreé R, then rae I. 


Remark 1.9 It is inductively clear that Definition 1.12 implies that if a1,a2,...,a@n, € I 


for any n € N, then ray + rga2 +++: +Tnay € I for any r1,172,..-,% € R. Moreover, if 
1eéTJ, then J = R. Also, if we are given a set of elements {a1,Q2,...,@,} in an integral 
domain R, then the set of all linear combinations of the a; for 7 = 1,2,...,n 


n 


SOG ere Ritor f= 1,2, 10.4m 


j=l 
is an ideal of R denoted by (a1, @2,...,Q@n). In particular, when n = 1, we have the 
following. 

Definition 1.13 — Principal and Proper Ideals 


If D is an integral domain and I is a D-ideal, then I is called a principal D-ideal if there 
exists an element a € I such that I = (a), where a is called a generator of I. If I 4 D, 
then I is called a proper ideal. 


Example 1.18 Let n € Z and set nZ = {nk : k € Z}, which is an ideal in Z and 
nZ = (n) = (—n) is indeed a principal ideal. Moreover, it is a proper ideal for all n 4 +1. 


Example 1.19 In D = Z([i], (2) and (3) are proper principal ideals. Moreover, the latter is 
an example of a special type of ideal that we now define—see Example 1.20 on the following 


page. 


Definition 1.14 — Prime Ideals 
If D is an integral domain, then a proper D-ideal P is called a prime D-ideal if it satisfies 
the property that whenever af € ?, for a,8 € D, then either a € P or BE P. 


In order to discuss any more features of ideal theory, we need to understand how multipli- 
cation of ideals comes into play. 
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Definition 1.15 — Products of ideals 


If D is an integral domain and I,J are D-ideals, then the product of J and J, denoted by 
IJ, is the ideal in D given by 


IJ={réeD:r=)>_ ajh; where n € N,anda,; € 1,8; € J forl <j <n}. 
j=l 


Theorem 1.7 — Criterion for Prime Ideals 


If D is an integral domain and J is a proper D-ideal, then J is a prime D-ideal if and only 
if the following property is satisfied: 


for any two D-ideals J, K, with Jk CI, either JC lTor Kk Cl. (1.17) 


Proof. Suppose that (1.17) holds. Then if a,6 € D such that af € I, then certainly 
(a8) = (a)(6) C I, taking J = (a) and K = (8) in (1.17), which therefore implies that 
(a) CI or (8) CTI. Hence, a € I or 6 € I. We have shown that (1.17) implies J is prime. 
Conversely, suppose that I is a prime D-ideal. If (1.17) fails to hold, then there exist D- 
ideals J, K such that JK CI but K ZIandJ ZI. Letae J witha¢g land GE K with 
B € 1, then af € I with neither of them being in J, which contradicts Definition 1.14 on 
the previous page. Hence, (1.17) holds and the result is secured. 


Now we prove a result that links the notion of prime element and prime ideal in the principal 
ideal case. 


Theorem 1.8 — Principal Prime Ideals and Prime Elements 


If D is an integral domain and a € D is a nonzero, nonunit element, then (a) is a prime 
D-ideal if and only if a is a prime in D. 


Proof. Suppose first that (@) is a prime D-ideal. Then for any 6,y € D such that a | BY, 
By € (By) © (a). Since (a) is a prime D-ideal, then 8 € (a) or 7 € (a) by Definition 1.14. 
In other words, a | Bora | y, namely a is a prime in D. 

Conversely, suppose that a is prime in D. If 6,7 € D such that Gy € (a), then there exists 
an r € D with By =ar. Since a is prime, then a | Bora | y. Ifa | 6, there is an s € D 
such that 8 = as, so 8 € (a). If a| 7, there is a t € D such that 7 = at, so 7 € (a). We 
have shown that (@) is a prime D-ideal by Definition 1.14, which completes the proof. 


Example 1.20 In Example 1.19 on the preceding page, (2) and (3) were considered as 
principal ideals in the Gaussian integers. By Exercises 1.26—1.27 on page 19, 3 is a prime in 
Zi], but 2 is not. Therefore, by Theorem 1.8, (3) is a prime ideal in the Gaussian integers 
but (2) is not. 


Now that we may look at products of ideals, we may we look at the notion of division in 
ideals in order to link this with elements and primes. 


Definition 1.16 — Division of Ideals 


If D is an integral domain, then a nonzero D-ideal I is said to divide a D-ideal J if there 
is another D-ideal H such that J = HI. 


The following shows that division of ideals implies containment. 
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Lemma 1.1 — ‘To Divide is to Contain 
If D is an integral domain and J, J are D-ideals, with I | J, then J CI. 


Proof. Since I | J, then by Definition 1.16, there is a D-ideal H such that J = TH. However, 
by Definition 1.12 on page 15, J = IH CIR CT, as required. 


Corollary 1.3 Suppose that D is an integral domain and I is a D-ideal satisfying the 


property: 
whenever J | JK for D-ideals J, K, we have I'| J or I| K. (1.18) 


Then J is a prime D-ideal. 


Proof. Suppose that I | JK, then by Lemma 1.1, JK C I, and (1.18) implies that either 
J CTlIor kK CTI. Thus, by Theorem 1.7, J is a prime D-ideal. 


The question now arises as to the validity of the converse of Lemma 1.1 in certain domains. 
In order to discuss this topic, we must prepare the stage with some essential topics. First 
of all there are types of ideals which are core to the theory. 


Definition 1.17 — Maximal Ideals 


In an integral domain D, an ideal M is called maximal if it satisfies the property that 
whenever M CI C D, for any D-ideal I, then either J = D or T= M. 


The next concept is necessary to prove our first result about maximal ideals. First note 
that if J, J are R-ideals, then J + J is necessarily an R-ideal since for any r € R,a € TJ, 
BEJ,r(a+ 8) €1+J by Definition 1.12 on page 15. We formalize this in the following. 


Definition 1.18 — Sums of Ideals Are Ideals 


If I, J are ideals in D, a commutative ring with identity, then J+J = {a+ 8:a€I,6 € J}, 
is an ideal in D. 


We use the above to prove our first result that we need to link maximality with primality. 


Theorem 1.9 — Quotients of Prime Ideals Are Integral Domains 


If D is an integral domain, then a D-ideal P is prime if and only if D/P is an integral 
domain. 


Proof. Suppose that P is a prime D-ideal. Then D/P is a commutative ring with multi- 
plicative identity 1p + P and additive identity Og + P. We must verify that D/P has no 
zero divisors. If a,8 € D with (a+ P)(G +P) =0rR +P =P, then aB+P=?, so ase P. 
Since P is prime, then either a € P or B € P. In other words, either a+ P = 0g + P or 
B+P=0r+?. We have shown that D/P has no zero divisors. 


Conversely, if D/P is an integral domain, then a8 € P implies that 
(a+ P)\(B+P) =aB+P =0R+P. 


Thus, having no zero divisors in D/P, either a+ P =0rg+P or B+P =0rR4+ P. In other 
words, either a € P or 8 € P, so P is a prime D-ideal. 


Now we link prime ideals with maximal ones. 
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Theorem 1.10 — Maximal ideals Are Prime 

If D is an integral domain, then every nonzero maximal D-ideal is prime. 

Proof. Suppose M # (0) is a maximal D-ideal, and M | IJ for some D-ideals I, J, with M 
dividing neither factor. By Lemma 1.1 on the preceding page, there exist a € J and 6 € J 


such that 
M | IJ | (a)(8) 


with M dividing neither (a) nor (G), namely a ¢ M and 6 ¢ M. Therefore, by Defini- 
tion 1.18 on the previous page, M + (a) and M + (() are D-ideals, both of which properly 
contain M,so M 4 D. Hence, by the maximality of M, we have, 


M+(a)=D=M +(£). 
Therefore, 
Mc D=D?=(M +(a))(M +(8)) © M? + (a)M +(8)M + (a)(8)M CM, 


a contradiction. We have shown that either 14 | (a) or M | (8). Therefore, by Corollary 1.3 
on the preceding page, M is prime. 


The next result tells us when an ideal is maximal with respect to quotients in integral 
domains. 


Theorem 1.11 — Fields and Maximal ideals 
If D is an integral domain, then M is a maximal D-ideal if and only if D/M is a field. 


Proof. First we need the following fact. 
Claim 1.2 D is a field if and only if the only ideals in D are (0) and D. 


If D is a field and I ¥ (0) is a D-ideal, then there exists a nonzero element a € I. However, 
since D is a field, then there exists an inverse a~! € D of a. By Definition 1.12 on page 15, 
aa-t=1p €1,sol=D. 

Conversely, suppose that the only D-ideals are (0) and D. If a € D is nonzero, let 


(a) =aD =I. 


By hypothesis, J = D. Thus, there exists a G € D such that Ga = 1p, so a is a unit. 
However, a was chosen as an arbitrary nonzero element in D, so D is a field. This is Claim 
12, 


Suppose that D/M is a field for a given D-ideal M. If M CI C D for a D-ideal J, then 
I/M is an ideal of D/M, so by Claim 1.2, I[/M = (0) or I/M = D/M. In other words, 
either J = D or I = M, namely M is maximal. 

Conversely, if M is maximal, then by Theorem 1.10, either MM = (0) or M is prime. If 
M = (0), then D/(0) & D is a field by Claim 1.2, given that (0) is maximal, implying that 
D has no proper ideals. If M is prime, then by Theorem 1.9 on the preceding page, D/M 
is an integral domain. Thus, it remains to show that all nonzero elements of D/M have 
multiplicative inverses, namely that ifa+M 4 M, then a+M has a multiplicative inverse 
in D/M. Givena+M #4 M, then a ¢ M. Thus, M is properly contained in the ideal 
(a) + M. Hence, (a) + M = D. In other words, 


lp =m+ra for some m € M andre D. 


Therefore, 1p —ra =m € M,solp+M =ra+M =(r+M)(a+M), namely r+ M is 
a multiplicative inverse of a+ M in D/M, so D/M is a field. 
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Example 1.21 If D=Z/nZ, where n €N, then by Theorem 1.11, Z/nZ is a field if and 
only if nZ is maximal. Later we will see that Z, being a special case of the ring of integers 
of a number field, always satisfies the property that all prime ideals are maximal—see 
Definition 1.23 on page 25 and Theorem 1.26 on page 42. Hence, in conjunction with 
Theorem 1.10, we have 


Z/nZ is a field if and only if nZ is prime. 


Example 1.22 Let F bea field, r € F is a fixed nonzero element, and 
I= {f(x) € Flax]: f(r) = 0}. 


We now demonstrate that I is a maximal ideal in Fa]. First, we show that I is indeed an 
ideal in Fla]. If g(x) € Fla], then for any f(x) € I, g(r)f(r) = 0, so g(x) f(a) € I, and 
clearly f(r) + h(r) = 0 whenever f(x), h(x) € I, which shows that I is an F'[a]-ideal. If we 
define ¢ to be the map 

6: Fle] Flal/1, 


given by 


then an easy check shows that I = ker(¢)—see (A.3) on page 325 in Appendix A, from 
which it follows that J is maximal, as 


F & Fla]/1. 


In §1.4, we will use ideal theory developed herein to introduce and explore two distinguished 
types of domains that set the stage for Dedekind’s masterpiece contribution presented in 
§1.5. This makes way for the foundational building bricks of algebraic number theory in §1.6, 
where algebraic numbers and numbers fields as generalizations of Z and Q are introduced. 
This provides the springboard to the balance of the text that explores this magnificent 
edifice of mathematics. 


The last section of this chapter, $1.7, is a motivator for Chapter 2 by looking in detail at 
the least nontrivial extension of Q, namely the quadratic field case, which builds upon the 
quadratic domains introduced and discussed in §1.2. 


Exercises 


1.26. Prove that any prime p € Z with p = 3(mod 4) is a prime in Z/i]. 
(By Corollary 1.1 of Theorem 1.6 on page 13 it only needs to be shown that p is 
irreducible.) 

1.27. Prove that if a € Z[t] and Nr(a) = p, where p is prime in Z, then a is a prime in Z/i] 
but p is not a prime in Z/i] and p = 1(mod 4) or p= 2. 

1.28. Prove that in an integral domain D with a,8 € D nonzero, as ideals (a) = (8) if and 
only if aB~! € Up. 


1.29. For some indexing set J, let R be a ring and let {R; : 7 € J} be any set of subrings of 
R. Prove that Nj;e7R,; is a subring of R. Also, show that if 


R, CR. C--- R; ¢ vee, 


then Uje7R; is a subring of R. 
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1.4 Noetherian and Principal Ideal Domains 


Whether you think you can, or you can’t—you are right. 
Henry Ford (1863-1947), American car manufacturer 


In this section, we use our knowledge of ideals to proceed to build the foundations of 
algebraic number theory by investigating two kinds of domains that will lead us into the 
building bricks of algebraic number fields. The following is crucial in the sequel. Some of 
the following is adapted from [54]. 


Definition 1.19 — Ascending Chain Condition (ACC) 


An integral domain R is said to satisfy the ascending chain condition (ACC) if every chain 
of R-ideals I, C Ig C ---I, C +--+ terminates, meaning that there is an np € N such that 
I, = In, for all n > no. 


Remark 1.10 An equivalent way of stating the ACC is to say that R does not possess 
an infinite strictly ascending chain of ideals. 

The above is a segue to the following important notion that will carry us forward toward 
our goals—see Biography 1.1 on page 23. 

Definition 1.20 — Noetherian Domains 

An integral domain R possessing the ACC is called a Noetherian Domain. 


For the following, the reader is reminded of the general notion of finite generation given in 
Definition A.7 on page 324 in Appendix A. Also, see Remark 1.9 on page 15. 


Lemma 1.2 — Finite Generation and Noetherian Domains 


If R is an integral domain, then R is a Noetherian Domain if and only if every R-ideal is 
finitely generated. 


Proof. Suppose that every R-ideal is finitely generated. Let 


ly Cig Ce. Ch Ce: 


be an ascending chain of ideals. It follows from Exercise 1.29 on the previous page that 
I = UJ; is an R-ideal, and since any R-ideal is finitely generated, then there exist a; € R 
for 7 = 1,2,...,d € N such that 


[= (a4, Q2,...,Qa). 


Therefore, for each 7 = 1,2,...,d, there isa k; with a; € Ip,. Let n = max{ky,ko,..., ka}. 
Then since J, C I and Ix, C In, given that k; <n for each such j, we have (a1, a2,...,Q@a) © 
I, which implies that J C I,. Hence, In, = UP2,J; and so I, = I; for each 7 > n. Since the 
chain terminates, R satisfies the ACC, so is a Noetherian domain. 

Conversely, suppose that R is a Noetherian domain. If J is an R-ideal that is not finitely 
generated, then I ¥ (0), so there exists a, € I with a; #0, and (a,) C I. Since I ¥ (a4), 
given that the former is not finitely generated, then there exists ag € I and a2 ¢ (a1) so 
we have 

(a1) Cc (a1, Q2) Gl. 


Continuing inductively in this fashion, we get the strictly ascending chain of ideals, 


(a1) C (a1, a2) (roe ae (Q1,Q2,---,An) C CT, 


contradicting that R is a Noetherian domain. Hence, every R-ideal is finitely generated. 
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Corollary 1.4 Let R be a Noetherian domain. Then every nonempty subset of R-ideals 
contains a maximal element. 


Proof. Let J be the set of ideals with the property that for every ideal I of J, there exists 
an ideal J of J with J c J. If J # @, then by its definition we may construct an infinite 
strictly ascending chain of ideals in J, contradicting Lemma 1.2. This is the result. 


Immediate from Corollary 1.4 is the following result. 


Corollary 1.5 Ina Noetherian domain R, every proper R-ideal is contained in a maximal 
R-ideal. 


We need the following concept that is intimately linked to the notion of a UFD, especially 
when we are dealing with Dedekind domains—see Definition 1.8 on page 7. 


Definition 1.21 — Principal Ideal Domain (PID) 


An integral domain R in which all ideals are principal is called a principal ideal domain, or 
PID. 


Theorem 1.12 — PIDs and Noetherian Domains 
If R is a PID, then R is a Noetherian domain. 


Proof. If we have a nested sequence of R-ideals 
(a1) € (a2) © -+-(aj) G+, 
then it follows from Exercise 1.29 that U?2,(a;) is an R-ideal. Thus, since R is a PID, there 
exists an a € R such that Uf2,(a;) = (@), so there exists an n € N such that a € (ap). 
Therefore, 
(aj) = (an) = (a) 


for all 7 > n. Thus, the ACC condition of Definition 1.19 is satisfied and R is a Noetherian 
domain. 


The following strengthens Corollary 1.2 on page 13 and puts Exercise 1.25 on page 14 into 
clearer focus. 


Corollary 1.6 A Euclidean domain is a PID, and so is Noetherian. 


Proof. If D is a Euclidean domain, then D has a Euclidean function ¢ by Definition 1.9 on 
page 10. Let I be a nonzero D-ideal and set 


8 = {d(a): a€I,a £0}. 


Given that I 4 (0), 8 4 @. Using the Well-Ordering Principle—see page 340—S has a least 
element ¢(3) where 6 € I, 8 £0. Let y € I be arbitrary. Then by part (b) of Definition 
1.9, there exist r,q € D with 


y= q8+r with d(r) < d(f). 


By Definition 1.12 on page 15, r = y— 8 € I, so by the minimality of 6(3), we must have 
that r = 0. Therefore, y = g3, which implies, since 7 was arbitrarily chosen, that I = (). 
We have shown that every ideal of D is principal (given that the zero ideal is principal as 
well), so D is a PID. By Theorem 1.12, D is therefore Noetherian. 
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Remark 1.11 Note that via Exercise 1.25, Corollary 1.6 is more general than Corollary 
1.2 since there are UFDs that are not PIDs and the following shows that Corollary 1.2 
follows from Corollary 1.6. Also, see the related Exercise 1.47 on page 34 


Theorem 1.13 — PIDs and UFDs 
If R is a PID then R is a UFD. 


Proof. Let § be the set of all a € R such that a is not a product of irreducible elements. 
If S £ @, then by Corollary 1.4 on the preceding page, via Theorem 1.12, 5 has a maximal 
element m. Thus, (m) is a proper ideal (since a unit is vacuously a product of irreducible 
elements by Definition 1.6 on page 4). Therefore, (m) is contained in a maximal R-ideal 
(M) for some M € R, by Corollary 1.5 on the preceding page, again via Theorem 1.12. 
Thus, M@ | m and (M) 4 (m) by Theorem 1.10 on page 18. Since M is a product of 
irreducible elements, there exists an @ | m such that q@ is irreducible. Therefore, m = af 
for some 6 € R. If 6 is a unit, then m is irreducible since associates of irreducibles are 
also irreducible, a contradiction. Hence, 6 is not a unit. If (G) ¢ 8, then 8 is a product of 
irreducibles, and so is m, a contradiction. Thus, (6) € 8. However, 6 | m, so (m) € (8) by 
Lemma 1.1 on page 17. Also, (m) 4 (8) since a is not a unit, given that it is irreducible. 
Hence, (m) is properly contained in (G) C 8, a contradiction to the maximality of (m) in 
&,so 8 = ©. This establishes that all nonzero elements of R are expressible as a product of 
irreducible elements. 

We may complete the proof by showing that all irreducible elements are prime and invoke 
Theorem 1.2 on page 7. Suppose that r € R is an irreducible element and r | aB,a,B € R, 
with r not dividing a. Then by the irreducibility of r, we must have that r and a are 
relatively prime, namely R = (r) + (a), so there exist 51,52 € R such that lp =rs; + a52. 
Therefore, 

(8) = (rs18 + as28) C (r), 

since r | a@ implies that (r) D (a), so both rs; € (r) and as28 € (r). In other words, 
r | 8, so r is prime as required. 


Exercises 


1.30. In a commutative ring R with identity, an R-module M is defined to be Noetherian 
if every ascending chain of submodules of M terminates in the same sense as in 
Definition 1.19 on page 20. Prove that if N is a submodule of a Noetherian R-module 
M, then both N and M/N are Noetherian R-modules. 


1.31. With reference to Exercise 1.30, either provide a counterexample to the converse or 
prove that: if N is a submodule of an R-module M such that both N and M/N are 
Noetherian R-modules, then M is a Noetherian R-module. 


1.32. If R is a Noetherian ring, prove that any finitely generated R-module is Noetherian. 


1.33. Let D; be integral domains for 7 = 1,2 with D, € Dg. If D, is Noetherian and Dz is 
finitely generated as a D,-module, prove that D2 is a Noetherian domain. 


1.34. Prove that Z[,/n] is a Noetherian domain for any nonsquare integer n. 


1.35. A commutative ring R with identity is said to satisfy the descending chain condition, 
denoted by DCC, on ideals if every sequence [) D [2 D --- D Ij D--- of R-ideals 
terminates. In other words, there exists an n € N such that J; = J, for all 7 > n. 
Prove that R satisfies the DCC if and only if every nonempty collection of ideals 
contains a minimal element. (Rings of the above type are called Artinian rings.) 


1.4. 


Noetherian and Principal Ideal Domains 


Biography 1.1 Emmy Amalie Noether (1882-1935) was born in Erlangen, 
Bavaria, Germany on March 23, 1882. She studied there in her early years 
and, in 1900, received certification to teach English and French in Bavarian 
girls’ schools. However, she chose a more difficult route, for a woman of that 
time, namely to study mathematics at university. Women were required to get 
permission to attend a given course by the professor teaching it. She did this 
at the University of Erlangen from 1900 to 1902, and passed her matricula- 
tion examination in Niirnberg in 1903, after which she attended courses at the 
University of Gottingen from 1903 to 1904. By 1907, she was granted a doctor- 
ate from the University of Erlangen. By 1909, her published works gained her 
enough notoriety to warrant an invitation to become a member of the Deutsche 
Mathematiker- Vereinigung, and in 1915, she was invited back to Gottingen by 
Hilbert and Klein. However, it took until 1919 for the university to, grudgingly, 
obtain her Habilitation,!:' and permit her to be on the faculty. In that year 
she proved a result in theoretical physics, now known as Noether’s Theorem, 
praised by Albert Einstein as a penetrating result, which laid the foundations 
for many aspects of his general theory of relativity. After this, she worked in 
ideal theory, developing ring theory, which turned out to be of core value in 
modern algebra. 


Her work Idealtheorie in Ringbereichen, published in 1921, helped cement this 
value. In 1924, B.L. van der Waerden published his work Moderne Algebra, 
the second volume of which largely consists of Noether’s results. Her most 
successful collaboration was in 1927 with Helmut Hasse and Richard Brauer 
on noncommutative algebra. She was recognized for her mathematical achieve- 
ments through invitations to address the International Mathematical Congress, 
the last at Zurich in 1932. Despite this, she was dismissed from her position at 
the University of Gottingen in 1933 due to the Nazi rise to power, given that 
she was Jewish. She fled Germany in that year and joined the faculty at Bryn 
Mawr College in the U.S.A. She died at Bryn Mawr on April 14, 1935. She was 
buried in the Cloisters of the Thomas Great Hall on the Bryn Mawr campus. 
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1.1 Habilitation is the highest academic qualification achievable in certain European and Asian countries. 
Typically Habilitation is earned after obtaining a research doctorate (Ph.D.), which is sufficient qualification 
for a senior faculty position at a university in North America. However, a Habilitation requires a professorial 
thesis, reviewed by and defended before an academic committee similar to that for a North American 
Ph.D., but the level of scholarship expected is usually much higher. In practice, for instance in Germany, 
a Habilitation is required to supervise doctoral students, a post that is known as Privatdozent and there 
are similarly termed appointments in other countries. After serving as Privatdozent, the next step is often 
appointment as a professor in the faculty in which the candidate sits. 


24 


1. Integral Domains, Ideals, and Unique Factorization 


Biography 1.2 Emil Artin (1898-1962) was born on March 3, Vienna, Austria 
in 1898. He served in the Austrian army in World War I, after which he entered 
the University of Leipzig. In 1921 he obtained his doctorate, the thesis of 
which was on quadratic extensions of rational function fields over finite fields. 
In 1923, he had his Habilitation, allowing him to become Privatdozent at the 
University of Hamburg—see Footnote 1.1 on the previous page. In 1925, he 
was promoted to extraordinary professor at Hamburg. In that same year, 
he introduced the theory of braids, which is studied today by algebraists and 
topologists. In 1928, he worked on rings with minimum condition, the topic 
of Exercise 1.35 on page 22, which are now called Artinian rings. In 1937, 
Hitler enacted the New Official’s Law, which enabled a mechanism for removing 
not only Jewish teachers from university positions but also those related by 
marriage. Since Artin’s wife was Jewish, although he was not, he was dismissed. 
In 1937, he emigrated to the U.S.A. and taught at several universities there, 
including eight years at Bloomingdale at Indiana University during 1938-1946, 
as well as Princeton from 1946 to 1958. During this time, in 1955, he produced 
what was, arguably, the catalyst for the later classification of finite simple 
groups, by proving that the only (then-known) coincidences in orders of finite 
simple groups were those given by Dickson in his Linear Groups. In 1958, 
he returned to Germany where he was appointed again to the University of 
Hamburg. Artin’s name is attached not only to the aforementioned rings, 
but also to the reciprocity law that he discovered as a generalization of Gauss’s 
quadratic reciprocity law. One of the tools that he developed to do this is what 
we now call Artin L-functions. He also has the distinction of solving one of 
Hilbert’s famous list of twenty-three problems posed in 1900—see Biography 3.4 
on page 94. 


He was an outstanding and respected teacher. In fact, many of his Ph.D. 
students such as Serge Lang, John Tate, and Max Zorn went on to major 
accomplishments. He also had an interest in astronomy, biology, chemistry, 
and music. He was indeed an accomplished musician in his own right, playing 
the flute, harpsichord, and clavichord. He died in Hamburg on December 20, 
1962. 
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1.5 Dedekind Domains 


Mathematics is the only instructional material that can be presented in an entirely 
undogmatic way. 

Max Wilhelm Dehn (1878-1952), German mathematician who 
introduced one of the first structured elucidations on topology 


§1.4 put us in a position to define a contribution by Dedekind —see Biography 1.3 on 
page 29. First we need the following notion. 


Definition 1.22 — Integral Over a Domain and Integral Closure 


If D C S where D and S are integral domains, then a € S is said to be integral over D if 
there exists a d € N, and a polynomial 


f(z) =29 + Bait? | +--+ Bix + Bo with B; € D for j =0,1,...,d-1 
such that f(a) = 0. 


D is said to be integrally closed in S if each element of S that is integral over D is actually 
in D. 


Example 1.23 The integral domain Z is integrally closed in Q, but not in C since ~—1 € C 
is integral over Z. 


The following will prove to be a useful tool in §1.6, and is of interest in its own right. The 
reader should solve Exercise 1.40 on page 33 in anticipation of the proof. 


Theorem 1.14 — Towers of Integral Domains 


If RCS CT is a tower of integral domains with S' integral over R and ¢t € T integral over 
S, then ¢ is integral over R. 


Proof. Given that t € T is integral over S, there exist 51, 52,...,5n € S such that 


t? + Spit? +--+ + sit +59 = 0. 


Hence, we have shown that t is integral over Riso, $1,...,5n]. Since s; € S is integral over 
R for j = 0,1,...,n —1, then by part (c) of Exercise 1.40, R[so,51,...,8n] is a finitely 
generated R-module. Since ¢ is integral over R[so, $1,.--, Sn], then the same exercise part 


(d) shows that 
R[s0, $1,-- +5 Sn] [t] = Riso, $1,.--, Sn, ¢] 


is a finitely generated R-module. Hence, by part (e) of the exercise, ¢ is integral over R. 


Now we bring in Dedekind’s ideas. 

Definition 1.23 — Dedekind Domains 

A Dedekind Domain is an integral domain D satisfying the following properties. 
(A) Every ideal of D is finitely generated. 

(B) Every nonzero prime D-ideal is maximal. 


(C) D is integrally closed in its quotient field F’. 
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Remark 1.12 Condition (C) says that if a/@ € F is the root of some monic polynomial 
over D, then a/8 € D, namely 6 | ain D. Also, note that by Lemma 1.2 on page 20, 
Condition (A) may be replaced by the condition that R is a Noetherian domain. 


Now we aim at the main goal of this section, which is a unique factorization theorem for 
ideals. To this end, we first settle conditions for which the converse of Lemma 1.1 on page 17 
holds. We require a more general notion of ideal in order to proceed. 


Definition 1.24 — Fractional Ideals 


Suppose that D is an integral domain with quotient field F. Then a nonempty subset I of 
F is called a fractional D-ideal if it satisfies the following three properties. 


1. For any a,8 €l,a+ 6 el. 
2. For anya él andre D,rae I. 


3. There exists a nonzero y € D such that yI C D. 


When I C D, we call I an integral D-ideal (which is the content of Definition 1.12 on 
page 15) to distinguish it from the more general fractional ideal. 


Remark 1.13 It is immediate from Definition 1.24 that if I is a fractional D-ideal, then 
there exists a nonzero y € D such that yJ = J where J is an integral D-ideal. 


Example 1.24 Let D=Z, and F =Q. Then the fractional D-ideals are the sets 
Ig ={qZ:qe€ Qth. 


Since qZ = (—q)Z, we may restrict attention to the positive rationals Q* without loss of 
generality. Also, 
Lg Lg. = 1GgQZ = Lg q2: 
We have the isomorphism 
S={I,:¢eQyeQ, 
as multiplicative groups. The unit element of S is Z and the inverse element of J, € § is 
(I,)~! = q"'Z. (See Exercise 1.43 on page 33.) 


Example 1.24 motivates the following. 


Theorem 1.15 — Inverse Fractional Ideals 

If D is an integral domain with quotient field F’, and I is a fractional D-ideal, then the set 
I+ ={a¢€F:aI CD} 

is a nonzero fractional D-ideal. 

Proof. If a,8 € I~!, then aI C D and BI C D, so 
(a+ 8)I Cal+6ICD, 


which implies a+ 8 € I7!. Ifa € I~! andr € D, al C D, then ral C D, from which 
it follows that ra € I~!. Lastly, let y be a nonzero element of J. Then for any a € I~}, 
al C D, so in particular, ya € D. Hence, yI~! C D. This satisfies all three conditions in 
Definition 1.24. 
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Definition 1.25 — Invertible Fractional Ideals 
In an integral domain D a fractional D-ideal I is called invertible if 


II=D, 
where J~!, given in Theorem 1.15, is called the inverse of I. 
Now we may return to Dedekind domains and the pertinence of the above to them. 


Theorem 1.16 — Invertibility in Dedekind Domains 


If D is a Dedekind domain, then every nonzero integral D-ideal is invertible. 


Proof. Since D is a Dedekind Domain, then every D-ideal J is finitely generated, so for 
I # (0), there are a; € D for 1 < j < d such that I = (a1,a2,...,aa). If d = 1, then 
I~! = (aj;") and IJ~! = D. Now the result may be extrapolated by induction, and the 
result is established. 


Corollary 1.7 — To Divide is the Same as to Contain 
If D is a Dedekind domain, and J, J are D-ideals, then 


I | J if and only if J CJ. 
Proof. In view of Lemma 1.1, we need only prove one direction. Suppose that 
JC. (1.19) 
Now let H = I~1J, in which case J = IH where H is a D-ideal since by (1.19), 


FAUsef TSP, 


where the equality follows from Theorem 1.16. Thus, I | J, and we have secured the result. 


As a consequence of Corollary 1.7, we see that a prime D-ideal P in a Dedekind domain D 
satisfies the same property as prime elements in Z—see Example 1.9 on page 4. 


Corollary 1.8 Suppose that D is a Dedekind domain. Then P is a prime D-ideal if it 
satisfies the property that for any D-ideals I, J, 


P| IJ if and only if P| I or P| J. 


Proof. By Corollary 1.7, P | IJ if and only if [J C P and the latter holds, by (1.17) on 
page 16, if and only if J C P or J C P, so applying Corollary 1.7 to the latter we get the 
result. 


We have the following result that mimics the same law for nonzero elements of Z. 


Corollary 1.9 — Cancellation Law for Ideals in Dedekind Domains 


Let D be a Dedekind domain. If I, J, are D-ideals with I 4 (0), and IJ C IL, then 
JCL, 


Proof. lf IJ = IL, then by Theorem 1.16, 


J=DJ=INIIJCIuL=DL=t, 


as required. 


Now we are ready for the promised unique factorization result. 
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Theorem 1.17 — Unique Factorization of Ideals 


Every proper nonzero ideal in a Dedekind domain D is uniquely representable as a product 
of prime ideals. In other words, any D-ideal has a unique expression (up to order of the 
factors) of the form 

Li PT Peer Po, 


where the P; are the distinct prime D-ideals containing J, and a; € N for j = 1,2,...,n. 


Proof. First we must show existence. In other words, we must show that every ideal is 
indeed representable as a product of primes. Let § be the set of all nonzero proper ideals 
that are not so representable. 

It follows from Remark 1.12 on page 26 and Corollary 1.4 on page 21 that if 8 4 @, then 
& has a maximal member M. By assumption, M cannot be prime and hence not maximal 
in D, so contained in some maximal prime D-ideal P. Also, P is maximal by part (B) of 
Definition 1.23. Hence, 

Dec ut, 


which implies 


MCMP1CMM =D, 


where the equality follows from Theorem 1.16 on the previous page. We have shown that 
MP" is an integral D-ideal. If P~-'M = M, then 


PP-1M = PM CP, 


where the latter inclusion comes from the fact that P is an ideal. Hence, M = P by 
the maximality of P, a contradiction to M € 8. Thus, P-'M #4 M, so M c P-!M, 
namely P~'M is an integral ideal not in 8. This means there are prime ideals P; for 
j =1,2,...d € N such that 

P-1M =2P1Po-+-Pa, 


which implies 
M=DM=PP 1M = PP1Po--- Pa, 


contradicting that M € 8. We have shown & = ©, thereby establishing existence. It remains 
to show uniqueness of representation. 
Let P; and Q; be (not necessarily distinct) prime D-ideals such that 


Pe 3 DS Oy x50 Oe (1.20) 
Hence, 
Pio Bieta, 
so 9; C Py for some j = 1,2,...,s. Without loss of generality, we may assume that 7 = 1, 


by rearranging the 9; if necessary. However, by condition (B) of Definition 1.23, P; = Q). 
Multiplying both sides of (1.20) by Py, we get 


Po. Py = Oo? 05. 


Continuing in this fashion, we see that by induction, r = s and P; = 9; forl <j <s=r. 
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Biography 1.3 Julius Wilhelm Richard Dedekind (1831-1916) was born in 
Brunswick, Germany on October 6, 1831. There he attended school from the 
time he was seven. In 1848, he entered the Collegium Carolinum, an educational 
bridge between high school and university. He entered Gottingen at the age 
of 19, where he became Gauss’ last student, and achieved his doctorate in 
1852, the topic being Eulerian integrals. Although he taught in Gottingen 
and in Ztrich, he moved to Brunswick in 1862 to teach at the Technische 
Hochschule, a technical high school. In that year he also was elected to the 
Gottingen Academy, one of many honours bestowed on him in his lifetime. He 
maintained this position until he retired in 1894. Dedekind’s creation of ideals 
was published in 1879 under the title Uber die Theorie der ganzen algebraischen 
Zahlen. Hilbert extended Dedekind’s ideal theory, which was later advanced 
further by Emmy Noether. Ultimately this led to the general notion of unique 
factorization of ideals into prime powers in what we now call Dedekind domains. 
Another of his major contributions was a definition of irrational numbers in 
terms of what we now call Dedekind cuts.He published this work in Stetigkeit 
und Irrationale Zahlen in 1872. He never married, and lived with his sister 
Julie until she died in 1914. He died in Brunswick on February 12, 1916. 


Now we look at PIDs and UFDs in the case of Dedekind domains. 


Theorem 1.18 — UFDs are PIDs for Dedekind domains 
If R is a Dedekind domain, then R is a UFD if and only if R is a PID. 


Proof. In view of Theorem 1.13 on page 22, we need only prove that R is a PID when it is 
a UFD. Therefore, if there exists an R-ideal that is not principal, then by Theorem 1.17, 
there exists a prime R-ideal P that is not principal. Let 8 consist of the set of all R-ideals I 
such that PJ is principal. By Exercise 1.38 on page 33, 8 4 @. By Remark 1.12 on page 26 
and Corollary 1.4 on page 21, § has a maximal element M. Let 


PM = (a). 


If a = By where @ € P is irreducible, then (6) = PJ where J is an R-ideal such that J | M, 
so J D M. By the maximality of M, we have J = M, so y is a unit and a is irreducible. 
Since P is not principal, there is a nonzero 6 € P — (a), and since M = (a) would imply 
that P = R, there is a nonzero o € M — (a). Thus, do € PM C (a), so a | da. However, a 
divides neither 6 nor 0, so a is not prime. This contradicts Theorem 1.2 on page 7. 


The developments in this section allow us to now define gcd and lcm concepts for ideals 
that mimic those for rational integers. 


Definition 1.26 — A gcd and lcm for Ideals 
If D is a Dedekind domain, and I,J are D-ideals, then 


gcd(I, J) =14+ J, and lem(I, J) =IN J. 


If ged(/, J) = D, then I and J are said to be relatively prime. 
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Remark 1.14 The notion of relative primality given in Definition 1.26 is the direct ana- 
logue for rational integers since D = (1p) is a principal ideal. This is of course what we 
mean in Z, since such a pair of integers can have no common divisors. Let us look at this 
directly. 

If I, J are relatively prime, then 


gcd(I, J) =I1+J=D. 
If a D-ideal H divides both J and J, then by Corollary 1.7 on page 27, J C H, J C H, so 
IT+J=DCH, 


which means that H = D. Hence, the only D-ideal that can divide both J and J is D = (1). 


The next result is the generalization of the result for rational integers proved in a course in 
elementary number theory. 


Lemma 1.3 — Product of the Ideal-Theoretic gcd and lcm 
If D is a Dedekind domain and J, J are D-ideals, then 


gcd(I, J) -lem(I, J) = (I+ J)(IN J) =I. 


Proof. By the definition of an ideal, any elements of J+ J times any element of JM J must 
be in J and J, so in JJ. Thus, 
In J)\L4+ J) Cl. 


Conversely, any element of JJ is in both J and J, so in IN J, and trivially in J+ J. Thus, 


IJ C(INJ)\(I+ J), 


from which the desired equality follows. 


The following exploits our unique factorization result to provide an analogue of the same 
result for rational integers. 


Theorem 1.19 — Prime Factorizations of gcd and lcm of Ideals 


Suppose that D is a Dedekind domain and J, J are D-ideals with prime factorizations given, 
via Theorem 1.17, by 
aj _ bj 
1=||?7, acd J=[[ 97, 
j=l j=l 


where P; are prime D-ideals with integers a;,b; > 0. Then 


gcd(I, J) = tye and Iem(J, J) “I is 
j=l 


where m,; = min(a;,b;) and M; = max(a,,b;), for each 7 = 1,. 


Proof. Since ged(I, J) = 1+ J, then 


ecd(I, J) IIe + 11% IIe IL" "+T]9 j-™MG) 


1.5. Dedekind Domains 31 


However, for each 7, one of a; — m; or b; —m, is zero, so the right-hand sum is D since the 
two summands are relatively prime. In other words, 


ged(I, J) =] P7", 
j=l 
as required. Now, by Lemma 1.3, (IN J)(I+ J) = IJ, so 


“ a5 b; . MN 5 
=] 9" =[[ Pr any) = 04+ D0), 


j=l j=l 
oe r r 

a;+b;—m; M; 

nS t= | err = | a8 


j= j=l 


and we have the complete result. 


Remark 1.15 Theorem 1.19 tells us that, when D is a Dedekind domain, lcm(J, J) is 
actually the largest ideal contained in both J and J, and gcd(J,J) is the smallest ideal 
containing both J and J. 


The following allows us to compare unique factorization of elements with that of ideals and 
show where Dedekind’s contribution comes into play. 


Definition 1.27 — Irreducible Ideals, gcds and lcms 


If D is an integral domain, then a D-ideal J is called irreducible if it satisfies the property 
that whenever a D-ideal J | I, then J=I or J=D. 


Theorem 1.20 — Irreducible = Prime in Dedekind Domains 


If D is a Dedekind domain, and J is a D ideal, then I is irreducible if and only if J is a 
prime D-ideal. 


Proof. Let I be irreducible and let J, K be D-ideals such that I | JK. Since gced(I, J) | I, 
then gcd(I, J) = I or gced(I, J) = D. If ged, J) = I, then I + J = I, which means that 


I=J=gcd(, J). 


Now suppose that I { J. Then gcd(I, J) = D, so there exist a € I and 8 € J such that 
a+ $=1p. Therefore, given an arbitrary y € K, y = ya+ 78. Since I | JK, then by 
Corollary 1.7 on page 27, JK C I, so By € I since Gy € JK. However, ay € Iso ye I. 
This shows that kK C I, so by Corollary 1.7, we have that I | Kk. Hence, by Theorem 1.7 
on page 16, J is prime. 

Conversely, suppose that I is prime. If J = HJ for some nontrivial D-ideals H and J, then 
either I|H or I|J. If I|H, there is a D-ideal L such that H = IL. Therefore, 


I=HJ=ILJ. 


By Corollary 1.9 on page 27, (1) = D=LJ. Hence, J = (1) = D, so I is irreducible. 


The following is immediate from Theorem 1.20, and is the analogue of the definition of a 
rational prime. 
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Corollary 1.10 If D is a Dedekind domain, then J is a prime D-ideal if and only if it 
satisfies the property that whenever J | I for a proper D-ideal J then I = J. 


Remark 1.16 It follows from Theorem 1.1 on page 5 and Theorem 1.2 on page 7 that the 
failure of unique factorization in an integral domain D is the failure of irreducible elements 
to be prime in D. However, since Theorem 1.20 tells us that irreducible ideals are the same 
as prime ideals in a Dedekind domain, then we have unique factorization restored at the 
ideal level via Theorem 1.17 on page 28. Thus, the magnitude of of Dedekind’s contribution 
is brought to light by this fact. 


We conclude this section with a result that is the generalization of the result for Z. The 
reader should be familiar with the basics of ring actions such as that covered in Appendix 
A, pages 326-328. 


Theorem 1.21 — Chinese Remainder Theorem for Ideals 


Let R be a commutative ring with identity and let [,,...,I, be pairwise relatively prime 
ideals in R. Then the natural map 


p:R/MNa Lj R/L xx R/T, 


is an isomorphism. 
The above statement is equivalent to saying that if 8), 2,...,8, € R, there existsa GER 


such that 6 — 6; € I; for each j = 1,2,...,r, where 6 is uniquely determined modulo 
N_11;. The latter means that 


any 7 satisfying y — 6; € I; for each such j implies B — y € Nj_1Jj. (1.21) 


Proof. Since 7(s) = 0 if and only if s € Ni_,J;, then ker(w) = (0), since the J; are pairwise 
relatively prime. It remains to show that w is a surjection. Let (3), 62,..., (0; € R. We must 
show that there is a 8 € R such that (8) = (61,...,6,). This is tantamount to saying: 
there is a 8 € R such that B — 6, € I, for each k. Since I; + J; = R for alli 4 j, then by 
induction I, + Oj;4%1; = R. Thus, for each such k, there exists an ay € Ip and rg € Oj ¢K1; 
such that 

Be =Oan+re with By—-—re € Ip and ry € I; for all j # k. 


Set 6 = D%_17j- Then 


B— Be = >_ 175 + (re — Ba) € Tes 
J#k 


as required. 


Remark 1.17 In Theorem 1.21, we may use the notation 
y = 8; (mod J;), 
to denote y — 6; € I;. Then (1.21) becomes: 


any 7 satisfying y = 6; (mod J;) for 1 <j <r implies 6 =y (mod Mj_, Jj). 
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Exercises 


1.36 


1.37. 


1.40. 


1.41. 


1.42. 


1.43. 


Let R be a Dedekind domain. If J, J are R-ideals, prove that there exists an a € I 
such that ged((a), IJ) = I. 


Let R be a Dedekind domain, and let I, J, H be R-ideals. Prove that 


I(J +H) =I13 +18. 


. Let R be a Dedekind domain and J, J nonzero R-ideals. Prove that there is an R-ideal 


H, relatively prime to J, such that HT is principal. 


. Let R be an integral domain with quotient field F’. Prove that every invertible frac- 


tional R-ideal is a finitely generated R-ideal—see Appendix A pages 323-326. 
Establish each of the following. 
(a) If RC S CT is a tower of integral domains and t € T is integral over R, then t 
is integral over S. 


(b) Let R,S be integral domains such that RC S. If s € S, then s is integral over 
R if and only if R[s] is a finitely generated R-module. 


(c) Let R,S be integral domains such that R C S. If 51, s2,...,5n € S are integral 
over R, then R[s1, s2,...,n] is a finitely generated R-module. 


(d) If s € S$ and there is an integral domain U such that R[s] C U C S with U 
a finitely generated R-module, then s is integral over R and R[s] is a finitely 
generated R-module. 


(e) If RC S C T is a tower of integral domains with S integral over R, and t € T is 
integral over S, then t is integral over R. 


(f) If RC S CT is a tower of integral domains with T integral over S and S integral 
over R, then T is integral over R. ( Transitivity of integrality.) 


Let R be an integral domain with quotient field F. Prove that every nonzero finitely- 
generated submodule I of F is a fractional R-ideal. 


Prove that in an integral domain R, the following are equivalent. 


(a) Every nonzero fractional R-ideal is invertible. 


(b) The set of all fractional R-ideals G forms a multiplicative group. 
Prove that in an integral domain R, the following are equivalent. 


(i) R is a Dedekind domain. 


(ii) Every proper R-ideal is a unique product of a finite number of prime ideals (up 
to order of the factors), and each is invertible. 


(iii) Every nonzero R-ideal is invertible. 
(iv) Every fractional R-ideal is invertible. 


(v) The set of all fractional R-ideals forms a multiplicative abelian group. 


(Hint: Use Exercises 1.89-1.42.) 
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1.44. 


1.45. 


1.46. 


1.47. 


1.48. 


1.49. 
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Suppose that R is a Dedekind domain with quotient field F' and I is an R-ideal. Also, 
we define: 
ordp(I) =a 


where a > 0 is the largest power of the prime ideal P dividing J. In other words, 
Pe | I but ??*! does not divide J. The value ord>(JZ) is called the order of I with 
respect to P. Prove the following. 


(a) For R-ideals I, J, 
ordp(IJ) = ord, (I) + ordp(J). 


(b) For R-ideals I, J, 
ordp(I + J) = min(ordp(Z), ordp(J)). 


(c) For any R-ideal I, there exists an a € F' such that ordp((a)) = ordp(JZ) for any 
prime R-ideal P | I. 


(We will have occasion to invoke this new concept when we have developed the tools to 
study reciprocity laws—see Proposition 6.8 on page 296 and the discussion following 
it.) 

Prove that every R-ideal in a Dedekind domain R can be generated by at most two 
elements. 

(Hint: Use Exercise 1.44.) 


Prove that D is a Dedekind domain if and only if D is integrally closed, every nonzero 
prime ideal is maximal, and D is Noetherian. 


With reference to Exercise 1.25 on page 14, prove that an almost Euclidean domain 
is a PID, and hence Noetherian. 

(Note that this is stronger than Exercise 1.25 since there are UFDs that are not 
PIDs—see Remark 1.11 on page 22.) 

Prove the converse of Exercise 1.47, namely that a PID is almost Euclidean. 


(Hint: Define a function ¢ on the PID such that (a) = 2” where n € N is the number 
of irreducibles into which a uniquely factors.) 


(Exercises 1.47-1.48 verify the assertion made in Exercise 1.25 wherein we noted that 
Greene [25] proved: D is almost Euclidean domain if and only if D is a PID.) 


Determine whether or not 
n 
J= {= :n,meZ,m>0,n > 0} 
2am 


is a fractional Z-ideal. 


. Let F = Q(V10) and Dy = Z[V10}]. Find the inverse of the 9 p-ideal 


I = (6,24 V10) 


—see Definition 1.25 on page 27. 
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1.6 Algebraic Numbers and Number Fields 


Only a fool learns from his own mistakes. The wise man learns from the mistakes of 
others. 


Otto von Bismark (1815-1898), German statesman 


§1.1-§1.5 built the foundation for us to introduce the fundamentals of algebraic number 
theory. This involves the generalization of the integral domain Z and its quotient field Q. 
To see how this is done, we consider the elements of Z as roots of linear monic polynomials, 
namely if a € Z, then a is a root of f(a) = x—a. Then we generalize as follows, with some 
of what follows adapted from [54]. 


Definition 1.28 — Algebraic Integers 


If a € C is a root of a monic, integral polynomial of degree d, namely a root of a polynomial 


of the form : 


f(z) = So aja! =a9tae+---+agi2t'+a%¢ Zz], 
j=0 


which is irreducible over Q, then a is called an algebraic integer of degree d. 


Example 1.25 a+b/—-1 =a + bi, where a,b € Z, with b £ 0 is an algebraic integer of 
degree 2 since it is a root of x? — 2ax + a? + b? but not a root of a linear, integral, monic 
polynomial since b £ 0. 


In Definition 1.3 on page 2 we introduced primitive roots of unity which are a distinguished 
type of algebraic integer. Another special type of algebraic integer is given by the following. 


Example 1.26 Numbers of the form 2 + z1¢n + 2262 +++ + 2n-107', for z; € Z are 
called cyclotomic integers of order n. 


Now we develop the generalization of the rational number field as a quotient field of a special 
ring for which this sets the stage. 


Definition 1.29 — Algebraic Numbers and Number Fields 


An algebraic number, a, of degree d € N is a root of a monic polynomial in Q[2] of degree 
d and not the root of any polynomial in Q[z] of degree less than d. In other words, an 
algebraic number is the root of an irreducible polynomial of degree d over Q. Denote the 
subfield of C consisting of all algebraic numbers by N, and the set of all algebraic integers 
in N by A. An algebraic number field, or simply number field, is of the form 


F = Q(a1,a2,...,Q@n) C C with n € N where a; EN for 7 = 1, 2;...;7: 


An algebraic number of degree d € N over a number field F is the root of an irreducible 
polynomial of degree d over F. 


Remark 1.18 If F' isa simple extension, namely of the form Q(a), for an algebraic number 
a, then we may consider this as a vector space over Q, in which case we may say that Q(a) 
has dimension d over Q having basis {1,a,...,a7~1}. (See Theorem A.4 on page 325. Also, 
see Exercise 1.51 on page 43 to see that all number fields are indeed simple.) 
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By Definition 1.29, Q is the smallest algebraic number field since it is of dimension 1 over 
itself, and the simple field extension Q(q@) is the smallest subfield of C containing both Q 
and a. 


We now demonstrate that A, as one would expect, has the proper structure in N, which 
will lead us to a canonical subring of algebraic number fields. If necessary, the reader may 
review the basics on modules beginning on page 323 in Appendix A. 


Theorem 1.22 — The Ring of All Algebraic Integers 
A is a subring of N. 


Proof. It suffices to prove that if a,8 € A, then botha+6¢€A anda € A. To this end 
we need the following. 


Claim 1.3 If a € A, then Z[a] = { f(a) : f(a) € Z[a]} is a finitely generated Z-module. 


Since a € A, then there exist a; € Z for 7 = 0,1,...,d—1 for some d > 1 such that 


até —dq_1ae! —-+++-—aya—ap = 0. 
Therefore, 
a? = ag_1a*—! +. ag_20* 7? +--+» +a1a+ a9 € Zo* 1 +---+Zat+Z, 
and 
attt — Giga" + Gao + b--+ +4107 + aga Zot +Zat1+.-.-4+Za?+Za 


Cleo 4 Ga? beet Da $Z. 
Continuing in this fashion we conclude, inductively, that 
ae Za) 4 Zot? eset Zor+Z, 
for any c > d. However, clearly, 
a CLa 4 4 ot +e bast Ze 
for c=1,2,--- ,d—1, so 
a? € Zat 14 Zot? +---+Za+Z, 


for any c > 0. Hence, Z[a] is a finitely generated Z-module. This completes Claim 1.3. 


By Claim 1.3, both Z[a] and Z[6] are finitely generated. Suppose that a1, a2,...,a% are 
generators of Z[a] and 6,,b2,...,bg are generators of Z[6]. Then Z[a,G] is the additive 
group generated by the a,b; for 1 <i<kand1<j<&. Thus, Z[a,(] is finitely generated. 
Since a + 8,a6 € Zla,8] C A we have secured the theorem. 


Given an algebraic number field F, FMA is a ring in F’,, by Exercise 1.29 on page 19. This 
leads to the following. 


Definition 1.30 — Rings of Integers 


If F is an algebraic number field, then FMA is called the ring of (algebraic) integers of F, 
denoted by Op. 


1.6. Algebraic Numbers and Number Fields 37 


With Definition 1.30 in hand, we may now establish a simple consequence of Theorem 1.22. 


Corollary 1.11 The ring of integers of Q is Z, namely Dg = QNA=Z. 


Proof. If a € ANQ, then a = a/b where a,b € Z and gced(a,b) = 1, with b 4 0. Since 


a € A, there exists an 
d 


f(x) =a0+ 55 aja? € Za] 


j=l 


with aq = 1, such that f(a) = 0. If d= 1, then we are done, since ap + a € Z and ap € Z. 
If d > 1, then 


d 
ao + So ajai EZ, 
j=l 
so 
: oe : ajaIbt—J Z 
dae => bd 
j=l j=l 


Therefore, b4 divides 7°_, aja/b4~J. Since d > 1, b divides 4} aja/b*~4, so b | at. But 
gcd(a,b) =1,sob=1landaeZ. 


Corollary 1.12 If F is an algebraic number field, then QN Op = Z. 


Proof. Since DOr C A, then by Corollary 1.11, QNOr C QNA = Z. But clearly Z C QNY fp, 
so we have equality. 


Remark 1.19 In order to state the next result, we require a few comments on the notion 
of finite generation. By Definition 1.29 and Claim 1.3 in the proof of Theorem 1.22, we 
know that for any number field F’, Or is finitely generated as a Z-module. Thus, any 9 r- 
ideal J will have a representation as I = (a1, Q2,...,aa) with a; € Or for 7 =1,2,...,d, 
and we say that I is finitely generated. In the instance where d = 1, we are in the case of 
Definition 1.13 on page 15, namely a principal ideal. Also, see Remark 1.9 on page 15. 


Corollary 1.13 If F' is a number field, then Or is a Noetherian domain. 


Proof. This follows from Remark 1.19 above and Lemma 1.2 on page 20. 


In Definition 1.22 on page 25, we defined integrality over a domain. Now we extend this 
notion to algebraic numbers and number fields. 


Definition 1.31 — Elements Algebraic Over a Domain 


If RC S where R and S are integral domains, then if R is a field and a is integral over R, 
then a is said to be algebraic over R. Also, if every nonconstant polynomial f(x) € R[x] 
has a root in R, then R is said to be algebraically closed. Moreover, any extension field that 
is algebraic over R and is algebraically closed is called an algebraic closure of R, and it may 
be shown that an algebraic closure is unique up to isomorphism. 


Remark 1.20 In view of Definition 1.29 on page 35, and Definition 1.31 above, we may 
now restate the notion of an algebraic number as a complex number that is algebraic over 
Q. Moreover, in view of Definition 1.28 on page 35 and Definition 1.31, we see that an 
algebraic integer is a complex number that is integral over Z. 
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Given an element a that is algebraic over a number field F’, Definition 1.31 tells us that 
there is a monic polynomial mq,r(x) € F [a] with ma,r(a) = 0. We may assume that ma,r 
has minimal degree. Hence, mq,r must be irreducible, since otherwise, a would be the root 
of a polynomial of lower degree. Thus chosen, ma,r(x) is called the minimal polynomial of 
a over F. It turns out this polynomial is also unique. 


Theorem 1.23 — Minimal Polynomials Are Unique 


A number a € C is an algebraic number of degree d € N over a number field F if and only 
if a is the root of an unique irreducible monic polynomial 


Ma,F(@) € Fla]. 
Any h(x) € F[a] such that h(a) = 0 must be divisible by mg, r(x) in Fa]. 


Proof. If a is an algebraic number of degree d over F’, then by Definition 1.29, we may let 
f(x) € F[z] be a monic polynomial of minimal degree with f(a) = 0, and let h(x) € F[z] 
be any other monic polynomial of minimal degree with h(a) = 0. Then by the Euclidean 
algorithm for polynomials (see Theorem A.13 on page 333) there exist q(x),r(a) € F[z] 
such that 


h(x) = q(x) f(x) + r(x), where 0 < deg(r) < deg(f) or r(x) = 0, the zero polynomial. 


However f(a) = 0 so h(a) = 0 = f(a), and r(a) = 0, contradicting the minimality of f 
unless r(x) = 0 for all «. Hence, f(x) | h(a). The same argument can be used to show that 
h(x) | f(a). Hence, h(a) = cf(a) for some c € F. However, f and h are monic, so c = 1 
and h = f. This proves that f(x) = mq,r(«) is the unique monic polynomial of a over F. 
The converse of the first statement follows a fortiori. 

To prove the second statement, assume that h(x) € Fa] such that h(a) = 0 and use the 
Euclidean algorithm for polynomials as above to conclude that ma,r(x) | h(a) by letting 
Ma,F(«) = f(x) in the above argument. 


Corollary 1.14 An irreducible polynomial over an algebraic number field has no repeated 
roots in C. In particular, all the roots of mq,r() are distinct. 


Proof. If F' is a number field and f(a) € F[2] is irreducible with a repeated root a, then 
f(x) = e(x — a)*g(2), 


for some c € F and g(x) € C{z]. By Theorem 1.23, ma,r(2x) | f(x) so f(x) = ama,r(x) for 
some a € F, since f is irreducible. However, f’(x) = 2c(a — a)g(x) + c(a — a)?g' (x), where 
f’ is the derivative of f. Hence, f’(a) = 0, so by Theorem 1.23, again mo,r(x) | f' (a), 


contradicting the minimality of ma,r(«) since deg(f’) < deg(f). 


Corollary 1.15 If a € A, then my,a(x) € Z[z]. 


Proof. This follows from Definition 1.28 on page 35 and Theorem 1.23. 


Now our goal is to demonstrate that algebraic integers are sufficient to characterize algebraic 
number fields. First we need the following crucial result. 


Lemma 1.4 — Algebraic Numbers as Quotients of Integers 


Every algebraic number is of the form a/¢ where a is an algebraic integer and ¢ € Z is 
nonzero. 
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Proof. By Definition 1.29, if y is an algebraic number, there exist a; € Q for 7 = 
0,1,2,...,d—1 such that y is a root of 


f(x) =ao + a,2 4 ayn? Lge age oe", 
Since 
ao tary tagy? +++: +ag-197 1 +77 =0 
we may form the least common multiple, £, of the denominators of the a; for 7 = 0,1,...,d. 


Then multiplying through by @, 
(ly)? + (€ag_1)(Ly)2* ++ + (02-101) (Cy) + C409 = 0. 


Thus ¢y is the root of a monic integral polynomial, so ¢y is an algebraic integer, say, a. 
Hence, y = a/é, with a € A and @€ Z. 


Corollary 1.16 — Quotient Fields of Number Rings 
If F is a number field, then the quotient field of Op is F. 


Proof. Let K = {aB-!:a,8 € Or, 6 # 0} be the quotient field of Or. Suppose that 
y=aB-!€K. Since Dp C F, then y € F, so K C F. Now if y € F, then by Lemma 1.4, 
 y =a/é where a € A and ¢ € Z. However, since a = yf € FN A = Of by Definition 1.30 
on page 36, then a € Op C F,so K C F. Hence, K = F. 


Theorem 1.24 — The Primitive Element Theorem for Number Fields 
If F is an algebraic number field, then there is an algebraic integer a such that F' = Q(a). 
Additionally, if 6 is algebraic over F' with minimal polynomial mg,r(x), then 


|F(8) : F| = deg(mg,r). 


Proof. By Exercise 1.51 on page 43, F' = Q(y) for some algebraic number 7, and by Lemma 
1.4, Q(y) = Q(a/l) = Q(a) or some a € A. 


The second statement will follow if we can show that every element 6 € F() is uniquely 


represented in the form 
d-1 


b=) 7 a;8' € FIA], 
j=0 

where deg(mp,j(z)) = d. Since 5 = f(8)/9(8) with f(x), g(a) € Fla] and (8) 4 0, 
then by Theorem 1.23 on the facing page, mg,r(x) does not divides g(x). Therefore, 
gced(g(x),mg.r(x)) = 1, so by Theorem A.13, there exist s(x),t(x) € F[z] such that 
5(t)g(2e)+4(x)rg,n(t) = 1. Since mp, p(B) = 0 then s(8) = 1/9(8). Thus, 6 = f(8)/9(8) = 
f(B)s(8). Let h(x) = f(x)s(x) € Fla]. By Theorem A.13 again, there exist q(x), r(x) € 
F [a] such that h(x) = q(x)mg,r(x) + r(x) such that deg(r) < deg(mg,r(x)) or r(x) = 0. 
However, 


6 = f(8)s(B) = h(B) = a(8)mg,r(8) + r(8) = r(8). 


It remains to show that r(x) is unique. Suppose that v(a) € F'[z] such that deg(v) < d—1 
and 6 = v(8). Thus, r(6) — v(8) = 0 so 8 is a root of r(x) — v(x) € F[a] contradicting the 
minimality of mg,r(x), whence r(x) — u(x) = 0, the zero polynomial, namely r(x) = v(x) 
as required to secure the second statement. 
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Example 1.27 Let FE = Q(vV2,i), where i = ¢, = V—1 is a primitive fourth root of unity. 
Then by Exercise 1.53 on page 43, 


Q(i, V2) =Q (=) and Cg = ae 


Example 1.28 If F = Q(t) and a = ¢ is a primitive eight root of unity, then 


Mea,F(£) = vi 
is the minimal polynomial of a@ over F. Moreover, the minimal polynomial of @ over Q is 
given by 
8 
ze —1 4 
i ame a +1, 


Mo,Q(2) = = 


which is an example of the following type of distinguished polynomial. 


Definition 1.32 — Cyclotomic Polynomials 


If n € N, then the n*” cyclotomic polynomial is given by 


6,(2)= [] (#-@), 


gcd(n,j)=1 
I<j<n 


where ¢,, is given by Definition 1.3 on page 2. The degree of® ,,(x) is d(n), where (n) is 
the Euler totient—see Definition A.22 on page 342. 


Remark 1.21 The reader may think of the term cyclotomic as “circle dividing,” since 
the n*” roots of unity divide the unit circle into n equal arcs. The cyclotomic polynomial 
also played a role in Gauss’s theory of constructible regular polygons—see [20, §365-§366, 
pp. 458-460]. 


Note that since the roots of the n“” cyclotomic polynomial are precisely the primitive n*” 
y' y 


roots of unity, then the degree of® ,(a) is necessarily ¢(n). We now demonstrate the 
irreducibility of the cyclotomic polynomial. 


Theorem 1.25 — Irreducibility of the Cyclotomic Polynomial 
For n EN, ®,(2) = m¢,,o(x), so® p(x) is irreducible in Z[z]. 


Proof. We may let 
®,,(x) = m¢,,,0(x)g(x) for some g(x) € Z[z] 


by Theorem 1.23 on page 38. 
Claim 1.4 m¢,0(¢2) =0 for any prime p{ n. 


If me,,o(G2) A 0, then g(¢2) = 0, so ¢, is a root of g(x’). By Theorem 1.23 again, 
g(x?) = me,o(x)h(a) for some h(x) € Z[a]. Let 


fla) = aja € Ze] 
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have image 
F(x) = So a2! 
j 
under the natural map 
Z|] + (Z/pZ)|2). 
Thus, 
G(x?) = Me, a(x)h(2). 
g’(x) since char(Z/pZ) = p. Therefore, 0 = g(¢?) = (G(¢n))? = G(Gn)- 
— 1), then 
a" —1=®),(x)k(x) = me,,o()g(x)k(@), 
for some k(a) € Z[x]. Therefore, in Z/pZ[z], 


However, g(x?) = 
Since® ,, (x) | (2? 


a” —L=2%—1=™¢, 0(x)9(z)k(2). 


Since 9 and 7™7,,q@ have a common root ¢,, then «” — 1 has a repeated root. However, 
this is impossible by irreducibility criteria for polynomials over finite fields, since p { n, (see 
Corollary A.8 on page 332 where we see: 


x” —1 is irreducible if and only if ged(x” — 1, a?’ x) =1for1<i< |n/2}). 


We have established Claim 1.4, namely that ¢? is a root of m¢,,.@(x) for any prime p { n. 


Repeated application of the above argument shows that y? is a root of me, g(a) whenever y 
is a root. Hence, ¢? is a root of me, 9(2) for all 7 relatively prime to n such that 1 <j <n. 
Thus, deg(m¢,,9) > ¢(n). However, me, o(2) | ®,,(x) so 


M¢,,,Q(Z) = ®n(z), 


as required. 


Corollary 1.17 For n €N, |Q(¢n) : Q| = d(n). 


Proof. By Theorems 1.24—1.25, in view of Definition 1.32, the result follows. 


At this juncture, we look at general properties of units in rings of integers, in keeping with 
one of the themes of this section. 


Proposition 1.1 Let a € A. Then the following are equivalent. 
(a) @ is a unit. 
(b) a] lin A. 
(c) If F = Q(a), then mg,r(0) = +1. 


Proof. The equivalence of (a) and (b) comes from Definition 1.1 on page 1. Now assume 
that a is a unit. Then, by Exercise 1.52 on page 43, 


d 
Mq,F(0) = (-1)* |] a; = 41 


if and only if a € Up, so (a) and (c) are equivalent. 


One of our main goals is the following result that leads us toward a unique factorization 
theory for ideals in rings of algebraic integers. In order to state it we need the following 
result which is motivated by Example 1.18 on page 15. 
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Lemma 1.5 — QO ,-Ideals Intersecting Z 


If F is a number field and J is a nonzero D--ideal, then IM Z contains a nonzero element 
of Z. 


Proof. Let aw € I where a £ 0 and consider mq,9(z) = a9 +a1%+-+:+ag_127-1+27, where 
a; € Z for all 7 = 0,1,...,d—1 by Corollary 1.15 on page 38. If d= 1, then ag = —a £ 0, 
and if d > 1, then ap £ 0 since my,Q(2) is irreducible in Q[z] by Corollary 1.15. Hence, 

1 


ao = —a1e == 4g 40" =o et 


as required. 


Theorem 1.26 — Rings of Integers are Dedekind Domains 


If F is an algebraic number field, then Or is a Dedekind domain. 


Proof. By Corollary 1.13 on page 37 (in view of the comment on condition (A) in Re- 
mark 1.12 on page 26), condition (A) of Definition 1.23 on page 25 is satisfied. 


Now we show condition (B) holds. Assume that there is a prime D-ideal P ¥ (0) that is 
not maximal. Therefore, the set S, of all proper O;-ideals that strictly contain P, must 
be nonempty. By Corollary 1.4 on page 21, there is a maximal ideal M € S such that 
PCM COrf. By Theorem 1.10 on page 18, M is a prime Dv-ideal. By Lemma 1.5, there 
exists a nonzero a € PZ. By Exercise 1.29 on page 19, PM Z is a Z-ideal. 


Suppose that ab € PNZ, where a,b € Z. Since P is a prime OD r-ideal, then a € P or b E P so 
ae PNZ or be PNZ, which means that PN Z is a prime Z-ideal. If p € PNZ is a rational 
prime, then (p) C PZ and (p) is a maximal Z-ideal by Theorem 1.11 on page 18 since 
Z/(p) is a field by Example 1.21 on page 19. Hence, since PN Z F Z, we have (p) = PN Z. 
However, (p) = ?NZC MNZCZ, where 1 ¢ M, so (p) = PNZ= MZ. Since M € 8, 
then P ~£ M, so there exists an a € M such that a ¢ P. Consider 


Mo.o(x) = x4 + ag_yxt-! +--+ ax + ag € Z[z] for some d EN. 
,Q 


Then mo,9(a) = 0 € P. Now define @ € N to be the least value for which there exist integers 
b; such that 
X =of +by_jae 1 +--+ + ba + bp EP, (1.22) 


for 7 = 0,1,---,@—1. Since a € M, then by properties of ideals, 
Y = alae} + by_yae 7? +--+ +01) € M. (1.23) 


Since P C M, then by (1.22)-(1.23), X —Y = bp € M, sob) €E MNZ=PNZ. If l=1, 
then a € P, a contradiction, so £ > 1. Thus, by (1.22), 


af + baa +---+b,a+ bo — bo = ala‘! + be_1a*-? +--++b1) EP. 


However, since P is prime and a ¢ ?, then a*~! + be_ja*-? +--+ +b; € P, contradicting 
the minimality of £ > 1. We have shown § = @, which establishes that condition (B) of 
Definition 1.23 holds. 

For condition (C), we note that since F is the quotient field of Of by Corollary 1.16 on 
page 39, then any a € F is integral over Op. Since Op is integral over Z, then by part (e) 
of Exercise 1.40 on page 33, a is integral over Z. In other words, a is an algebraic integer 
in F, namely a € Op. Hence, O- is integrally closed and we have part (C) that establishes 
the entire result. 
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Exercises 
1.51. Prove that if an algebraic number field F' is of the form 
F = Q(q1, Q2,..-,Qn) 
for n € N where a; for 7 = 1,2,...,m are algebraic numbers, then there is an alge- 
braic number ¥ such that F = Q(y). (Hence, all algebraic number fields are simple 
extensions of Q.) 
(Hint: It suffices to prove this for n = 2 with ay =a and az = B. Let 
da 
Ma,Q(@) = | | (x — 95), 
j=l 
where the a; are the conjugates of a over Q, and let 
dg 
ma,o(x) = |} (@— 65), 
j=l 
where the B; are the conjugates of 8; = 8 over Q. Select q € Q with 
q # (a — ax) /(8; — 8) 
for any k =1,2,...,dq and any j =1,2,...,dg Also, let 
y=atap 
and 
f(@) = Ma,a(7 — 92): 
Prove that 8 is the only common root of f(x) and mg.g(x). Show that this implies 
Q(a,8) C Q(y). The reverse inclusion is clear.) 

1.52. Let F be an algebraic number field. Prove that if a € Up, then a; € Up for all 
j =1,2,...,d, where mq.r(x) = z4 + ag_1z4-! +--+ +12 + a9, for some d € N is 
the minimal polynomial of a over F, and a; are the roots of ma r(x). Conclude that 
if F is an algebraic number field, then a € Up if and only if Mean aj; = +1. 

1.53. Referring to Example 1.27 on page 40, prove that 

1+2 
Qi, v2 =a ) 
(i, V2) fa 
and that if ¢g is a primitive eighth root of unity, then it is an odd power of (1+1)/V2. 
1.54. Prove that 


2” -1=]] (a), 


d|n 


where® q(x) is the cyclotomic polynomial given in Definition 1.32 on page 40. 
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1.7 Quadratic Fields 


It’s not that I’m so smart; it’s just that I stay with the problem longer. 
Albert Einstein (1879-1955), German-born theoretical physicist 


In this section we use the tools developed in this chapter and apply them to quadratic fields. 
This is a precursor to the general number field development later in the text and gives an 
overview of the least nontrivial case of a number field extension of Q. 


First we establish the rings of integers for quadratic fields. This extends our discussion begun 
in Application 1.2 on page 3. Then, we show that a given quadratic field is determined by 
a unique squarefree integer. We note that if f(#) = 2? + ax + b € Q{a], is irreducible, and 
a € Cis a root of f(x), then the smallest subfield of C containing both Q and a is given 
by adjoining a to Q, denoted by Q(a), so 


Q(a) = {x+ ya: 2z,y € Q}. 


This is what we call a quadratic field, which we loosely discussed in Application 1.2 on 
page 3. 
Quadratic polynomials with the same squarefree part of the discriminant give rise to the 
same quadratic field. To see this suppose that: 

f(x) =a? + ba +c and g(x) = 2? + bya +c, € Q[a] are irreducible, 


A=0b? —4c=m?D 


v 


and 
Ay = b? = 4c, = m2D, 


where m,m, € Z and D is squarefree. 


Then 
Q(VA) = Q(Vm2D) = Q(mvVD) = Q(VD) 


= Q(miVD) =Q ( m2) = Q(/A}). 
Thus, we need the following to clarify the situation on uniqueness of quadratic fields. 


Theorem 1.27 — Quadratic Fields Uniquely Determined 
If F is a quadratic field, there exists a unique squarefree integer D such that F = Q(VD). 


Proof. Suppose that F = Q(a), where a is a root of the irreducible polynomial x? + bx +c. 
By the well-known quadratic formula a € {a1, 2}, where 


—b+ Vb? — 4e —b— Vb? —4e 
5 . 


, and ag = 5 


Since a, = —a2 — b with b € Q, then Q(a,) = Q(az) = Q(a). However, 


Q(a1) =O (4 a %) = Q( VP 40). 


ag = 


Let a = b? — 4c = e/f € Q. Then a ¥ d? for any d € Q since x? + bx + ¢ is irreducible 
in Q[a]. Without loss of generality we may assume that gcd(e, f) = 1 and f is positive. 
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Let ef = n?D, where D is the squarefree part of ef. Hence, D 4 1, and arguing as in 
the preamble to this theorem, Q(VD) = Q(/a), observing that Q(,/e/f) = Q(Vef). This 


shows existence. It remains to prove uniqueness. 


If D, is a squarefree integer such that Q(VD) = Q(VD1), then VD = u+ vWD, with 
u,v € Q. By squaring, rearranging, and assuming that uv 4 0, we get 


D — wu? — Dv? 
VD, = —,.— €Q@ 
2uu 
which contradicts that D, is squarefree. Thus, wv = 0. If v = 0, then VD € Q, contradicting 
the squarefreeness of D. Therefore, u = 0 and D = v7Dj, but again, D is squarefree, so 
v? = 1, which yields that D = D,. 
Now we are in a position to determine the ring of integers of an arbitrary quadratic field, 
which we motivated in Application 1.2 on page 3. 


Theorem 1.28 — Rings of Integers in Quadratic Fields 


Let F be a quadratic field and let D be the unique squarefree integer such that F = Q(vVD). 
Then 


feats Z ean if D = 1(mod 4), 
pe 

Z|VD] if D # 1(mod 4). 
Proof. Let 


_j 2 if D=1(mod 4), 
7) 1 if D#1(mod 4). 


Then since (1+ VD)/q is a root of x? — 2x/a + (1 — D)/o? we have 
—1+VD 
Z+Z (oe? | Cop 
o 


It remains to prove the reverse inclusion. 


Let a € Op C F. Then a =a+bVD where a,b € Q. We may assume that b ¥ 0, since 
otherwise we are done, given that 


zcnrn(S te), 
(on 


Since Op is a ring, then a! = (a— bVD), a+ a! = 2a, and aa’ = a? — Db? are all in 
Or. However, the latter two elements are also in Q, and by Corollary 1.12 on page 37, 
Dr NQ=Z, so 


Ja,a° = Db’ € Z. (1.24) 
Case 1.1 a ¢ Z. 


We must have a = (2c + 1)/2 for some c € Z. Therefore, by (1.24), 4(a? — Db?) € Z, which 
implies 4Db? € Z. However, since D is squarefree, then 2b € Z. (To see this, observe that 
if 2b = g/f where g, f € Z with gcd(f,g) = 1, and f > 1 is odd, then 4Dg? = f7h for some 
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héZ. Thus, since gcd(4g, f) = 1, f? | D contracting its squarefreeness.) If b € Z then, by 
(1.24), a € Z, contradicting that a = (2c+1)/2. Therefore, b = (2k + 1)/2 for some k € Z. 
Thus, 


2 1)? D(2k+1)? 1-—D 

a2 — Dee = et ) ( a5 ) =—@t+e (ke +k)D 4 ; 

4 4 4 
which implies (D — 1)/4 = c? +c— (k? +k)D — a? + Db? € Z, hence, D = 1(mod 4) and: 
2c+1 (2k+1)VD (2k +1)(1+ VD) 
2 2 2 
oe (14) ee z(t?) 
o 


Case 1.2 a€ Z. 


In this instance, by (1.24), Db? € Z, and arguing as above, since D is squarefree, b € Z. 
Hence, 


66:9 DEZSZ/D=24 z(otev?), 
oO 


which completes the reverse inclusion that secures the theorem. 


Definition 1.33 — Quadratic Field Discriminants 
If D is the unique squarefree integer such that F = Q(VD) is a quadratic field, then the 
discriminant of F' is given by 
fete D_ if D=1(mod 4), 
ve) 4D if D#1(mod 4). 


Remark 1.22 Definition 1.33 follows from the fact that the minimal polynomial of F' is 
x? —x+(1—D)/4 if D = 1(mod 4), and is 2? — D if D # 1(mod 4). In §2.3, we will 
study general number field discriminants and prove the fact, implicit in Definition 1.33, 
namelyA ¢ = 0,1(mod 4), holds for any number field F’. This is known as Stickelberger’s 
Theorem—see Biography 1.4 on page 54 and Theorem 2.10 on page 77. 


Example 1.29 Suppose we have an irreducible quadratic polynomial 
f(x) = ax? + be +c € Q[z]. 


Then A= 0? —4ac is the discriminant not only of f(a), but also the quadratic field Q(WA). 
By the quadratic formula, the roots of f(x) are given, since a 4 0, by 


—b+VA , —b-VA 
a= ———,, and a’ = ——__,, 
2a 2a 


where a’ is called the algebraic conjugate of a. By Exercise 1.1 on page 6, Q(a) = Q(VA), 
which we know is a simplest nontrivial number field, a quadratic field over Q. 


The reader will note that some easily verified properties of conjugates are given as follows. 
(a) (a8)! = a’ 6". 
(b) (at BY =a! + 6 
(c) (a/6)' = a'/8', where a/8 = 6 € Q(VA). 
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Remark 1.23 If D < 0 in Theorem 1.28 on page 45, F is called a complex (or imaginary) 
quadratic field, and if D > 0, F is called a real quadratic field. Also, the group of units in 
a quadratic field forms an abelian group. For real quadratic fields we will learn about this 
group later, since it is more complicated than the complex case which we now tackle. The 
reader will recall the notion of groups and notation for a cyclic group, (g), generated by an 
element g—see Definition A.3 on page 320. 


Theorem 1.29 — Units in Complex Quadratic Fields 
If F = Q(VD) is a complex quadratic field, then 


(Ge) = CHEF) if D = 3, 


Ur =Yor =) (4) =(v-1) if D=-1, 

(C2) = (-1) otherwise. 

Proof. By Theorem 1.28 we may write u = a+bV/D € Up,, with ca,ob € Z where a is 
defined as in the proof of Theorem 1.28. Hence, if D # 1(mod 4), then a? — b?D = 1, for 
some a,b € Z. If D < —1, then a?—b?D > 1 for b #0. Thus, b = 0 for D#1(mod 4) with 
D < —1. In other words, 


Us, = (-1) = (@) if D=2,3 (mod 4) and D < -1. 


Now we assume that D = 1(mod 4), so a?— Db? = 4 for a,b € Z. If D < —4, then for b 4 0, 
a? — Db? > 4, a contradiction. Hence, for D = 1(mod 4) and D < —4, Up, = (C2). It 
remains to consider the cases D = —1, —3. If D = —1, then by Theorem 1.28, Or = Z+Z(i], 
and a+bi is a unit in Of if and only if a?-+b? = 1. The solutions are (a,b) € {(0+1), (+1, 0)}. 
In other words, 

Ui = (+1, £1}. 


If D = —3, then a? + 3b? = 4, so either a = b = 1, or b= 0 and a = 2. Hence, the units are 
1, (14 V—3)/2, and (—1 + /—3)/2. However, 1 = C8 and we have: —1 = @, 


(1 + V—3)/2 = 6, 


(1— vV—3)/2 = @, 
(-1+ V—3)/2= @, 


and 
(—1- V—3)/2 = @. 
Hence, 
Uaiejam a (G6), 


as required. 


Now we look at multiplication of ideals in quadratic fields. If the reader is in need of a 
reminder about the basics involved in modules and their transition to ideals in the rings of 
integers in quadratic fields, then see Exercises 1.55—-1.58. In any case, see Exercise 1.62 on 
page 54. 
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Multiplication Formulas for Ideals in Quadratic Fields 


Suppose that 


F =Q(vD) 


is a quadratic number field, and 0 y is its ring of integers-see Theorem 1.28 on 
page 45. LetA fF be the field discriminant given in Definition 1.33 on page 46, 
and for 7 = 1,2 with a; € N,b; € Z, let 


T; = (a;, (6; + VAr)/2), 
be Op-ideals. Then 


r 


rts (on YER) 


where 
ay,az 
a3 = ’ 
g 
by, + be 
g= gcd a1, a2, 2 ’ 
and 


1 
b3 = F (Sash, + paybe + 5 (bi b2 + Ar) (mod 2as), 


where 6, 4, and v are determined by 


daz + Hay, + 5 (hi + bz) = 9. 


Note the above formulas are intended for our context, namely the ring of integers of a 
quadratic field O-, called the maximal order. In an order contained in Op that is not 
maximal, the above does not work unless we restrict to invertible ideals. For the details 
on, and background for, orders in general, see either [49, §1.5] or [50, §3.5]. Also, see 
Definition 1.25 on page 27 and Exercise 1.43 on page 33. 


Example 1.30 ConsiderA ¢ = 40, with 
I, = (3,1+ 10) and Ig = (3,-1+4 V10), 


so in the notation of the above description of formulas for multiplication, we have 


a, = ag = 3,b, = 2 = —bo, 9g = 3,6 =0 =v, u=1,b3 = 1, and a3 =1, 


= Llp = (8,1 -+/10)(3, -1 + V10) = (3). (1.25) 


Hence, the product of J; and Jy is the principal ideal (3) in Z[V10] = Op, and by Theo- 
rem 1.8 on page 16, (3) is not a prime ideal in Op since (3) divides I, Jz but does not divide 
either factor. To see this, note that if 


(3) | (3,41 + V10), 


then by Lemma 1.1 on page 17, 


(3,41 +10) € (3), 
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which is impossible since it is easy to show that +1+/10 ¢ (3). Moreover, by Exercise 1.61 
on page 54, I; and I, are prime O r-deals. 


Example 1.30 motivates a study of prime decomposition of ideals in quadratic fields. For 
instance, (1.25) is the decomposition of the ideal (3) in Z[V/10] = Or into the product of the 
two prime ideals J; and Jj. In what follows, we have a complete description. The notation 
(D/p) in the following denotes the Legendre symbol—see Definition A.23 on page 342. Also, 
the symbol N(P) will denote the norm of a quadratic ideal as defined in Exercise 1.58 on 
page 54. 


Theorem 1.30 — Prime Decomposition in Quadratic Fields 


If Op is the ring of integers of a quadratic field F = Q(VD), and p € Z is prime, then the 
following holds. 


PiPy if p> 2,(D/p) =1, or p=2,D=1 (mod 8), 
where P;, are distinct prime 0 p-ideals for 7 = 1,2 
and N(P;) =p, 
(p) =pOr=< P if p > 2,(D/p) = -1, or p= 2,D=5 (mod 8), 
where ? is a prime 9D -ideal with N(P) = p?, 
p2 if p > 2,p|D, or p=2,D =2,3 (mod 4), 
where P is a prime 9 r-ideal with N(P) = p. 


Proof. For the sake of simplicity of elucidation in the following Cases 1.3-1.5, we present 
only the instance where 9p = Z[VD] since the proof for Dp = Z[(1 + VD) /2] is similar. 


Case 1.3 (D/p) =1 for p> 2. 
The Legendre symbol equality tells us that there exists a b € Z such that 
b? =D (mod p). 
Also, since p{ D, then p{ b. Let 
Py =(p,b+ VD) and Pz = (p,—b + VD). 


If Py = Po, then 
2b=b+VD—(-b+ VD) € Py, 


SO p | 2b by the minimality of p as demonstrated in Exercises 1.56—1.58, namely 
2bE€ Py NZ = (p). 


Thus, P; and P2 are distinct DO r-ideals. By the multiplication formulas given on page 48, 
we have, in the notation of those formulas, ag; = 1 and g = p, so 


Pi Po = (p). 
Case 1.4 (D/p) = —1 for p> 2. 
Let af € (p), where 


a=ayt+ b, VD, =agt+ boVD = Z[VD]. 
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Suppose that 6 ¢ (p). We have 


a8 = ayaz + b1b2D + (ab; + a1b2)VD = p(x + yVD), 


for some x,y € Z. Therefore, 
ayag + bybyD = px, (1.26) 


and 
ab + aybe = py. (1.27) 


If b; = 0, then by (1.26), p | a1a9. If p | a,, then a = a, € (p), so by Definition 1.14 on 
page 15, (p) is an Dp-prime ideal. If p | a2, then p{ by since 6 ¢ (p), so by (1.27) p | a, and 
we again have that a € (p). Hence, we may assume that b; 4 0. Similarly, we may assume 
that ay x 0. 


Multiplying (1.27) by a; and subtracting b; times (1.26), we get 
bo(a? — b7D) = p(ayy — biz). 


If p | (af — b?D), then there exists a z € Z such that a? — b?D = pz. Therefore, 
1— 7 1— % 


(2-2) (2) -@)-s 


a contradiction. Hence, p | be. By (1.27), this means that p | a2b,. If p | a2, then 


p | (a2 + b2VD), 80 8 € (p), 
a contradiction to our initial assumption. Thus, p | by, so 
p| (a1 + b, VD), which means that a € (p). 


Case 1.5 p> 2 and p | D. 


Let P = (p, VD). Then by the multiplication formulas on page 48, with aj = 1 and g = p 
in the notation there, P? = (p). This completes Case 1.5. 


It remains to consider the three cases for p = 2. 
Case 1.6 p=2 and D=1(mod 8). 


Let 


P= (35 (i+ vD)/2) and Py = (2, (-1+ vD)/2) 
Then by the multiplication formulas as used above with a3 = 1 and g = 2, we have 
P,P. = (2). 


If Py = Po, then = 
(1+ VD)/2+(-1+VD)/2=VDeE?, 


which is not possible. Thus, P, and P2 are distinct. This is Case 1.6. 


Case 1.7 p= 2 and D=5(mod 8). 
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Let af € (2), where 
a = (a, +6,VD)/2, B = (a2 + boVD)/2 € Z[(1 + VD)/2], 


with a; and b; of the same parity for 7 = 1,2. Suppose that 6 ¢ (2). We have 


ab = 


142 + bb2D + (agb; + ayb2)VD ~9 t+y/D Heat yVD 
4 2 
where x,y € Z are of the same parity. Thus, 


ayag + by b2D = 4x, (1.28) 


and 
ab + a bo = Ay. (1.29) 


Multiplying (1.29) by a and subtracting b; times (1.28), we get 
by (a? — bf D) = 4(ya, — xb;). 

If a? — b]D is even, then either a; and b; are both odd or both even. In the former case, 
1=a?=hD=5 (mod 8), 


a contradiction, so they are both even. Hence, 


a (s2egew?) 0, 


so (2) is a prime 0 p-ideal by Definition 1.14. If bz is even, then by (1.29), 2 | a2by. If 2 | aa, 


then 
eee (224 sen) 0), 


2 


contradicting our initial assumption. Hence, b; is even and so a, is even since they must be 
of the same parity. As above, this implies that a € (2). Thus, (2) is prime. This completes 
Case 1.7. 


Case 1.8 p= 2 and D = 2(mod 4). 


Let P = (2, VD), which is an Op-ideal by Exercise 1.61 on page 54. Moreover, P? = (2), 
by the multiplication formulas on page 48 with a3 = 1 and g = 2. 


Case 1.9 p= 2 and D=3(mod 4). 
Let P = (2,1+ VD), which is an Dp-ideal by Exercise 1.61. Moreover, as in Case 1.8, 


p22). 


This completes all cases. 
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Remark 1.24 Although we have not developed the full decomposition theory for ideals 
in general number fields, we will be able to talk about decomposition of ideals in quadratic 
fields. The following terminology will be suited to the more general case—see §5.1—so we 
introduce it here. Suppose that F = Q(VD) is a quadratic number field,A p is given as in 
Definition 1.33 on page 46, and (Ar/p) denotes the Kronecker symbol—see Definition A.25 
on page 343. If p € Z is a prime, then 


A 
(p) is said to split in F if and only if (=£) =, 
Pp 


A 
(p) is said to ramify in F if and only if (=£) =0, 
Pp 


and K 
(p) is said to be inert in F if and only if (=) =—1. 
Pp 


Note, as well, that from the proof of Theorem 1.30, when (p) = ?1P2, namely when (p) splits, 
then Py» is the conjugate of P;. This means that if P, = (p,b+WD) then Pz = (p,—b+ VD). 


Example 1.31 In Example 1.30 on page 48, withA - = 40, we saw that 


(3) = In = (3,1 + V10)(3, -1 + V/10), 


Ar\  (40\ _ 1 
3) \3} ° 
so (3) splits in Q(/10) into the two prime Z[V10]-ideals I, and Ip. 


In Examples 1.19 on page 15 and 1.20 on page 16, we saw that (2) is not a prime ideal in 
Z{i| and that (3) is a prime Z[i]-ideal. Since (2) = (1 +7)”, where 


where 


P=(2,14+) =(4+0)=(2,1-) =(1-9 
is a prime Z/i]-ideal, then (2) is ramified in F = Q(i), where 
Ar —4 
eG) 


Also, (3) is a prime ideal and we see that 
Ar —4 
— = — = — if 
(F)-(@)-3 


The following illustration shows that the converse of Lemma 1.1 on page 17 does not hold 
in general and that the multiplication formulas, on page 48, do not necessarily hold if we 
do not have the ring of integers of a quadratic field in which to work. 


so (3) is inert in F. 


Example 1.32 If R = Z[V5], then J = (2,1 + V5) is an R-ideal by Exercise 1.57, and 
clearly (2) = (2,2VW5) C I. If I | (2), then there exists an R-ideal J such that (2) = IJ. 
Thus, J has a representation J = (a,b + cVD) with a,c € N, b € Z, 0 <b <a, such that 
c| a, c| b, and ac | (b? —c?D). Moreover, J | (2), so by Lemma 1.1, (2) C J, so there exist 
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x,y € Z such that 2 = ax+(b+cV/D)y. Therefore, y = 0 and a | 2. Ifa =1, then I = (2), 
which means that 1+ V5 € (2), a contradiction, so a = 2. If b= 1, then ¢ = 1, so 


Yes (2), (1.30) 
However, by considering the multiplication of basis elements for J we see that 
P? = (4,2(1+ V5),6 + 2V5) = (4,2(1 + V5), 


where the last equality follows since 6 + 2\/5 is a linear combination of the other basis 
elements so is redundant. Thus, 


P = (4,2(1+ v5) = (2)(2,1+ V5) = (2), 


and combining this with (1.30), we get (2) = (2)I, which implies 2(1 + V5) € (2), again 
a contradiction. We have shown both that although (2) C J, I does not divide (2), and 
that the multiplication formulas for ideals in R fail to hold. Note, that R is not the ring 
of integers of a quadratic field by Theorem 1.28 on page 45. Indeed, by Corollary 1.7 on 
page 27, R is not a Dedekind domain. For instance, (2) is a prime R-ideal but, by the above, 
is not maximal, contradicting part (B) of Definition 1.23 on page 25. (R is what is known 
as an order in Op = Z[(1+ V5)/2] for F = Q(V5) and I is an example of an ideal in R 
which is not invertible in R—see [49, Chapter 1, pp. 23-30]. As we saw in Theorem 1.16 
on page 27, all integral ideals in a Dedekind domain are invertible. Thus, the multiplication 
formulas work in Op, but not in arbitrary orders where invertibility is not guaranteed.) 


Exercises 


1.55. Suppose that G is an additive abelian group, and that R is a commutative ring with 
identity 1z which satisfy each of the following axioms: 

(a) For each r € Rand g,h eG, r(g+h) = (rg) + (rh). 

(b) For each r,s € R and g € G, (r+s)g = (rg) + (sg). 

(c) For each r,s € R and g € G, r(sg) = (rs)g. 

(d) For each g € G, lr-g=g. 

Then G is a (two-sided) module over R, or for our purposes, simply an R-module. 


Prove that (in general) being a Z-module is equivalent to being an additive abelian 
group. 
1.56. Let R = Z[wp], D € Z not a perfect square, and wp = (sc —-1+ VD)/o, with o = 1 


if D 4 1(mod 4) and o = 2 otherwise—see Application 1.2 on page 3. Then every 
Z-submodule of R has a representation in the form 


I =[a,b+ cwp] 


where a,c € N and b € Z with 0 < b < a. Moreover, a is the smallest natural number 
in I and c is the smallest natural number such that b+ cwp € I for any b € Z. (Note 
that when c = 1, I is called primitive.) 


1.57. With reference to Exercise 1.56, prove that I = (a,b+ cwp) is an R-ideal if and only 
ifc|a,c|b, and 
(ob + c(a —1))? =D (mod o7ac). (1.31) 


(Note that we use the square brackets for Z-modules and the round brackets for 
ideals.) 
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1.60. 


1.61. 


1.62. 
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. With reference to Exercise 1.56, prove that the Z-module [a,b + cwp] for a,c € N, 


b € Z, is an R-ideal (a, b+ cwp) if and only if c | a,c | b, and (1.31) is satisfied. (Here 
a is the smallest natural number in J, called the norm of I, denoted by N(I).) 


. Let [a,8] = aZ + BZ and [7,6] = yZ+ 6Z be two Zmodules, with a, 8,7,6 € R, 


where R is given in Exercise 1.56. Prove that [a,] = [7,6] if and only if 


(5) =*(9) 


where X € GL(2,Z), which is the general linear group of 2 x 2-matrices with entries 
from Z, namely, those 2 x 2-matrices A such that det(A) = +1, also called unimodular 
matrices. (Note that, in general, GL(n, Z) is the general linear group of nxn matrices 
with entries from Z.) 


With reference to Exercise 1.56, prove that if a € R, and I = (a,a) is an R-ideal, 
then I = (a,na+ a) for any n € Z. 


Let F be a quadratic number field and let P = (p, (b+ /Ar)/2) be an D p-ideal where 
p€N is prime. Prove that P is a prime 0 p-ideal. 


Verify the multiplication formulas on page 48. 


Biography 1.4 Ludwig Stickelberger (1850-1936) was born on May 18, 1850 
in the canton of Schaffenhausen, Switzerland as the son of a pastor. In 1867 he 
graduated from a gymnasium,*® from which he went to study at the University 
of Heidelberg. Later he went to the University of Berlin to study under Karl 
Weierstrass (1815-1897), achieving his doctorate in 1874. His thesis topic was 
on transformations of quadratic forms to a diagonal form. Also, in 1874, he 
received his Habilitation from Polytechnicum in Zurich (now ETH Zurich)—see 
Footnote 1.1 on page 23. In 1879, he was appointed extraordinary professor in 
the Albert Ludwigs University of Freiburg, and was promoted to full professor 
in 1919. In 1924 he returned to Basel. Although he had married in 1895, both 
his wife and son died in 1918. He died on April 11, 1936, and was buried beside 
his wife and son in Freiburg. 


His publication output was at best modest, but his contributions may be 
characterized as important contributions to linear algebra, and algebraic num- 
ber theory, including the theorem that bears his name—see Remark 1.22 on 
page 46. He coauthored four published papers with Frobenius, three of them 
on elliptic functions. Stickelberger is best known for two papers. The first, 
Verallgemeinerung der Kreisteilung, generalizes results of Jacobi, Cauchy, and 
Kummer on Gauss and Jacobi sums. He used these results to find annthilators 
of class groups of abelian extensions of Q. The other, Uber eine neue Eigen- 
schaft der Diskriminante, shows that the Legendre symbol (S2) =(-l1)"9, 
where the number field F’ has degree n over Q, and g is the number of prime 
ideals in Or above p. The latter result implies the quadratic reciprocity law. 
The results in both papers have been generalized over the years. 


*The Gymnasium in the German education system, is a form of secondary school with 
a pronounced emphasis on academic achievement. This is comparable to the British former 
grammar school system or with prep schools in the United States. 


Chapter 2 


Field Extensions 


Good old Watson! You are the one fixed point in a changing age. 
spoken by Sherlock Holmes in His Last Bow (1917), title story. 
Sir Arthur Conan Doyle (1859-1930) 
Scottish-born writer of detective fiction 


In this chapter we explore in greater detail the notion of an algebraic number field intro- 
duced in Definition 1.29 on page 35, via generalizations thereof, which we develop in §2.1. 
In particular, this is a foundation for Galois theory, and a generalization of prime decom- 
position motivated by our coverage of the quadratic case in $1.7, which we generalize to 
arbitrary number fields in §5.1. 


2.1 Automorphisms, Fixed Points, and Galois Groups 


Given a number field F’, it is possible to define an embedding as a ring monomorphism 0 of 
F into C—see Definition A.10 on page 327, and the surrounding discussion, for background. 
Also, the reader should solve Exercises 2.1—-2.6 on pages 62-63 as a precursor, motivator, 
and adjunct material to the following. 

Definition 2.1 — Fixed Points and Isomorphisms 


Let kK C k, be two fields and let 0 be an embedding of K into k,. Then a € K is called 
a fixed point under 6 if O(a) =a. 


Remark 2.1 The name “fixed-point” is appropriate since, in the case where K = Kj, 6 
is an automorphism, so 6(@) = a = 6;(a), where 6; is the identity automorphism of K, 
namely, 0;(3) = 8 for all 6 € K. The set of all fixed points has a special designation. The 
reader should be familiar with the material surrounding Remark A.4 on page 327 for the 
following. 


Lemma 2.1 — Fixed Fields 
If K is any field, then 


F={6€K:6(6) =6 for all 6 € Aut(K)} 


is a field, called the fixed field of Aut(K). 
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Proof. We have for any automorphisms 6,7 of K, and any 6,7 € F, 
A(8 £7) = 0(8) £O(7) = 7(8) £7(y) = 7(B £7). 


Also, 

O(By) = 0(8)0(y) = 7(8)r(q) = (89). 
Since 0(6) = 7(8), then 

A(B)* = 7(B)* = r(8") = (8). 


Hence, sums, products, and inverses of fixed points are fixed points, so F’ is a subfield of 
K. 


Lemma 2.2 Distinct embeddings of a field K into a field Ky, are independent. In other 
words, if 0; are distinct embeddings of K into Ky, and 6; € K for 7 = 1,2,...,n, such that 


S° 6;(a)B; =O foralaek 


j=l 


then 6; = Bz =--- = Bn =0. 


Proof. We use induction on n. If n = 1, the result is clear, since 0; cannot be the zero map. 
Assume that the result holds for all natural numbers k <n. If 


S > 6; (a) 8; =0 
j=l 


for alla € K, and 8; £0 for some j, then 6; ¢ 0 for all j = 1,2,...,n, by the induction 
hypothesis. We may multiply through by 871 to get 


n-1 


On(a) + ¥- 6;(a)B;8,,1 = 0. (2.1) 


j=l 
Since the 6; are distinct, there exists some 7 € K such that 61(7) 4 @n(y). Now multiply 
(2.1) through by 0,(y)~! to get 


n-1 


An(@)On(7)* + >| 8;(a)On(7) "Bi Bq * = 0. (2.2) 
j=l 
Since (2.2) holds for all a € K, we may replace a by ya therein to get 


n—-1 


An(a) + S~ 6;(a7y)9n(7)* Bn 1B; = 0, 


j=l 


so 
n-1 


An (ae) + S> 0;()0;(7)On(7) 1 Bn By = 0. (2.3) 


j=l 
Now subtracting (2.3) from (2.1), we get 


n—1 


S$ 0;(a)8;B;,1(05(1)On(7)~1 — 1) =0. 


j=l 


However, 01(7)0n(y)~' — 1 40, since 6:(y) 4 On(y). This provides a dependency relation 
that contradicts the induction hypothesis, so 8; = 0 for all 7, and the result is complete. 
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Theorem 2.1 — Degrees Over Fixed Fields 

If 0,,...,0, are distinct isomorphisms of a field K into a field K,, and if F is the fixed field 
of {6,,...,0,}, then |K,: F| > n. 

Proof. If |K, : F| = m <n, then let Ki = F(aj1,...,@m) and consider the system of 
homogeneous equations for 7 = 1,...,m: 


n 


S- 0; (a4) x; = 0. 


j=l 


Since m <n, then by Theorem A.23 on page 338, there must exist solutions x; € K, not 


all zero, to these equations for 7 = 1,...,n. Also, for any y € Ky, there exist 8; € F such 
that 

m 

> Bias = 

j=l 
Now, for each 7 = 1,...,m, we have 


n 


01(8:) S- 0;(a4)2; = 0. 


j=1 


Then, since 6; € F, we have 0;(6;) = 6;(6;). Thus, 


n m n 


SO Co)e => OY Biaap= ¥ Oey. 


j=1 i=l j=l i=l gal: 


We have exhibited a nontrivial dependency relationship between the 6;, contradicting 
Lemma 2.2. 


Corollary 2.1 If 0,,...,0, are distinct automorphisms of a field K, and F is the fixed 
field of Aut(K), then |K : F| > n. 


In Exercise 2.6 on page 63 we introduce the notion of an F-isomorphism of a number field 
kk. We now generalize this notion. 


Definition 2.2 — Fixing Automorphisms 


Let K/F be an extension of fields. If @ is an automorphism of K such that 6(a@) = a for all 
a € F, then @ is said to fiz F,, or to leave F fixed, and is called an F'-automorphism of K. 


Lemma 2.3 — Groups and Fixing Automorphisms 


Let K/F be an extension of fields. The set of all F-automorphisms of K forms a group, 
denoted by Aut (Ic). 
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Proof. Two F-automorphisms 6}, 62 of a field K may be composed by defining 
9102(8) = 01 (62(8)) 


for each 6 € K. Then this product is also an automorphism of K. Also, if 6(a) = 8 for 
a given F-automorphism @ of K, we define 9~'(8) = a as the mapping that takes 6 to 
a, called the inverse of 6, which is also an F-automorphism of kK. Thus, for any two F- 
automorphisms 6; and 62 of kK, 6105 *(8) = 6 for any 6 € F, so 0105 * is an F-automorphism 
of kK. Thus, the set of all F-automorphisms of K forms a multiplicative abelian group. 


Although it is possible for Aut (Ic) to be infinite, the situation considered in this text for 
number fields will always deal with a finite group. Also, in general it is possible for the 
fixed field of Autr(Ic) to be bigger than F’, as illustrated by the following. 


Example 2.1 Let K = Q(W/2) C R and F = Q. Then Autp(K) = {6;}, the identity 
group consisting of only 0, which is the identity automorphism that fixes K pointwise. The 
reason is that the only possible images of W/2 are (3/2 ¢ K and (3 W/2 ¢ K, where ¢3 is 
a primitive cube root of unity, so both are images in C — R, since «* — 2 = 0 has roots 


x = V2, the only real root, as well as 7 = C3 /2, 3 2’, the only complex roots. 


The case where F is the fixed field of Aut(K) is of particular importance. Thus, we now 
seek to minimize the bound on the degree given in Theorem 2.1. The following, due to 
Artin, essentially generalizes Exercise 2.6 on page 63—see Biography 1.2 on page 24. 


Theorem 2.2 — Unique Automorphism Groups 


Let K/F be an extension of fields and let G be a finite group of automorphisms of Kk having 
F as its fixed field. Then 
|: F| =|G| = |Autp(K)], 


and G = Autr(K). 
Proof. Let G = {01,...,0n} with identity automorphism 6). If |K : F| > n, then there 
exist a; € K for 7 = 1,...,2 +1 such that the a; are linearly independent over F. By 


Theorem A.23 on page 338, there exists a nontrivial solution for k = 1,...,n +1 to the 
system of equations 


S > 6,(aj)2; = 0, where x; € K for j =1,...,.n+1. (2.4) 


If there is a solution with all x; € F, then 6;(a1) =a, = — pas 6,(a;)x;, contradicting 


the assumed linear independence. Thus, at least one of the values x; of any given solution 
cannot be in F’. Now select a solution set 


{as}j1 = {851 


in which there is a maximum possible number of nonzero elements, namely let m < n be 
the largest natural number such that 


Bm+1 =-+ = Bn4i =9 


and 6, # 0 for any r < m. If m = 1, then since 8,6;(a1) = 0 and @:(a1) = a1 4 0, 
then 6, = 0, a contradiction to the definition of m. Thus, m > 1. Also, without loss of 
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generality, we may select 3,, = 1, since we may multiply through by 67! to get another 
solution. Hence, for k = 1,...,n we have 


Y44(a4)3; = Ox(am) + D> Au(a)8, = 0. (2.5) 


Without loss of generality, we may assume that 6, ¢ F’. Therefore, there exists 0¢ such that 
00(61) A 6, for some = 1,...,n. 
Applying 02 to (2.5), we get 


m-1 


90;(Qm) + SS 6¢0;(a;)0e(8;) = 0, 


j=l 
fori =1,...,n2+1. Since 6:6; = 0, for some 1 = 1,...,n, this equation becomes 


m-1 


Ax(Qm) + S~ O5.(0%j)00(8;) = 0. (2.6) 


j=l 
Subtracting (2.6) from (2.5), we achieve 


m1 


S~ G4 (aj) (Bj — 8e(B;)) = 0. 


j=l 


Since 6(31) # (1, this is a solution to (2.4) having less than m nonzero elements, con- 
tradicting the minimality of m. We have shown that |K : F| < n, and by Theorem 2.1, 
|k : F| > n, so we have equality. Also, if there exists a 6 € Autr(K) such that 0 ¢ G, 
then there are n+ 1 distinct automorphisms of K which fix F’. Therefore, by Corollary 2.1, 
|k: F| >n+1, a contradiction. Thus, Autp(K) =G. 


The following encapsulates what is contained in Theorem 2.2—see Biography 2.1 on page 64. 


Definition 2.3. —— Galois Groups 


The uniquely determined group in Theorem 2.2 is called the Galois group of the field 
extension K/F' that is called a Galois extension, and Autr(K) is denoted by Gal(K/F). 


The above development is essentially due to Artin. However, we have a parallel development 
for the number field case for comparison, and will give a broader overview, in Exercises 2.1— 
2.6 on pages 62-63. 


The following links the above with the number field case and shows that the group in Defi- 
nition 2.3 is the one satisfying the following equivalent conditions. The following also holds 
in the case where the fields are finite or are any finite extensions of fields of characteristic 
zero—see Exercises 2.12—2.16 on page 64. The result is a preamble to the fundamental 
theorem for Galois theory. 


Theorem 2.3 — The Galois Group of a Number Field 
Let K/F be an extension of number fields. Then the following are equivalent. 


(a) The fixed field of G = Autr(K) = Gal(K/F) is F and |G| =|K: F|. 
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(b) For any a € K, mg,r(x) has all its roots in K. 
(c) K = F(a1,a2,...,aa) where a; are roots of some f(x) € Fla]. 


Proof. If (a) holds, then let 


h(x) = [] (@ -o()) € K [2]. 


o€G 


However, the elements of G permute the factors of h(a), so h(x) remains invariant under 
the action of G. However, since 0 = 1g € G, a—1g(a) = 0 is a factor of h(a), namely a is 
a root of h(a). Thus, by Theorem 1.23 on page 38, ma,r(x) | h(a), so all roots of ma, r(x) 
are in kX. Hence, (a) implies (b). 

Assume (b) holds. By Exercise 1.51 on page 43, there is an element y € K such that 
K = F(y). Since we are assuming that m,, r(x) has all its roots in K, then K is generated 
by the roots of m,,7(x) since K = F(a(7)) for any o € G. We have shown that (b) implies 
(c). 

To complete the logical circle, we show that (c) implies (a). (For the proof of this part, 
the reader should be quite familiar with Exercise 2.6 on page 63. In particular, be aware 
of the distinction between the notion of an F-automorphism and an F-isomorphism. The 
former implies the latter, but, as Example 2.1 on page 58 shows, in general the latter does 
not necessarily imply the former.) If o is an F-isomorphism of K, then o(a;) = ax where 
j,k © {1,2,...,d}, from which it is clear that c(i) = K, so o is an F-automorphism of K, 
namely o € Autr(K). By Exercise 2.6, the number of F-automorphisms of K is exactly 
|k : F| =d. Suppose that G = Autr(K) fixes 6 € K. Then every element of G is an 
F'(6)-automorphism of K. By Exercise 2.6 again, the number of F'(0)-automorphisms of 
is exactly |k : F(5)|. Hence, d < |K : F'(d)| which forces d = |K : F(6)|, namely 6 € F. 
This shows that F' is the fixed field of G, and 


|G| = |Autr(K)| = |K : F| =|Gal(K/F)|, 


which completes the task. 


We conclude this section with the following highlight of Galois theory. See Exercise 2.2 for 
the definition of a normal extension. 


Theorem 2.4 — Fundamental Theorem of Galois Theory 

Let K/F be a Galois extension of number fields with Galois group G = Gal(K/F). If H is 
a subgroup of G, then denote the fixed field of H by &(#), and if L is an intermediate field 
in K/F, let g(L) = Aut, (K). Then 


(a) The mappings g: L++ g(L) from intermediate fields to subgroups of G, and k: HH 
k(H) from subgroups of G to (intermediate) fixed fields are inverses of one another. 
Also, 

k( Ay) Cc k( A) if and only if g(k(H1)) = Ay =) Ay = g(k(A2)), 


namely, they are inclusion reversing. Furthermore, 


|k(H2) : k(A4)| = |Hy : Ad|. 


(b) K is Galois over any intermediate field L. Also, L is Galois over F' if and only if 
g(L) = Aut, (K) is normal in G. If the latter occurs, then 


Gal(L/F) = at 
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Proof. Let L be an intermediate field between K and F, and let a € K. Then my, (2) | 
Me,F(£), all of whose roots are in K by part (b) of Theorem 2.3. Therefore, all roots of 
Me,L(£) are in K, so by part (b) of Theorem 2.3 again, K/L is Galois. By part (a) of 
Theorem 2.3, DL is the fixed field of g(L) = Aut,(). In other words, k(g(L)) = L. On 
the other hand, if H is a subgroup of Gal(K/F), then H = Gal(K/k(H)). In other words, 
H = g(k(H)). We have shown that k and g are bijections and inverses of one another. 
Lastly, the inclusion reversal is now clear, so we have (a). 

Let LZ be an intermediate field between K and F, and let a € L. Then mg,r(f) = 0 
where 6 € K if and only if 6 = 6(a) for some 6 € Gal(K/F). By part (b) of Theorem 2.3 
one more time, L/F is Galois if and only if 0 € Gal(K/F), namely if 0(L) C L. Now, if 
0(L) CL, o € g(L) and ae EL, then 


0~*a0(a) = 0-'0(a) =a, 


so 0~-1a@ € g(L). We have shown that if L/K is Galois, then g(L) is normal in G. Con- 
versely, assume that g(Z) is normal in G. Ifa € K,0€ Gand o € g(L), then 


o0(a) = 00-'c6(a) = O(a), 


since 0~!a@ € g(L). Thus, (a) is fixed by g(L) so 0(a) € L. We have shown that if g(L) 
is normal in G, then 0(L) C L. Hence, L/F' is Galois. 
Finally, we establish the isomorphism given in (b). Let H = Gal(L/F). Since 0(L) C L 
for all 0 € Gal(K/F), 
6, € Aute(L) = Gal(L/F). 


Thus, the restriction mapping 0 + 6], is a homomorphism of G to H with ker(@|z) = g(L). 
Since 


Hl- |K:F| |G 
AL = K:L| |g(L)|’ 
Jk: DL] |g(Z)| 
then the restriction homomorphism is surjective, so 
G 
A >= —, 
g(L) 


which completes the proof of the fundamental theorem. 
The following diagram illustrates what Theorem 2.4 asserts. 


Diagram 2.1 

The mapping g: The mapping k: 

Fields Groups Fields Groups 
k — > 1 k(1) +—— 1 
U| al Ul M! 
EL ——+ g(L) k(H) «—— H 
U| al Ul al 
M ——-+ g(M) k(J) +— J 
U| al Ul al 


[roe MG) -—-G 
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Theorem 2.4 asserts that there is a one-to-one correspondence between the subgroups H of 
Gal(K/F) and the intermediate fields L, corresponding elements H and L being such that 
L=k(H) and H = g(L). This elegant relationship will be used in force in §5.4. 


Exercises 


2.1. Let a be an algebraic number. Prove that if F = Q(qa) is an algebraic number field of 
degree d over Q, there exist exactly d embeddings 9; of F into C for 7 = 1,2,...,d. 
Conclude that 6;(@) = a,j, for 7 = 1,2,...,d are precisely the roots of the minimal 
polynomial mq,9(x) of a over Q. 


(Hint: See Theorem 1.23 on page 88, Theorem 1.24 on page 89, and Application A.1 
on page 325.) 


(The elements 0;(a) are called the conjugates of a, which is a generalization of the 
concept for quadratic extensions introduced in Example 1.29 on page 46. Moreover, 
the fields Q(a;) are called the conjugate fields of F. Also, a; for 7 = 1,2,...,d 
are called the complete set of F-conjugates of a and Q(a;) for such j are called the 
conjugate fields of F. Thus, the F-conjugates of a do not depend on the choice of a 
such that F = Q(a). Note that if Q(a;) C R for all F-conjugates of F, then F is 
called a totally real field and if Q(a;) C C—R, then F is called totally complex.) 


Exercises 2.2-2.6 all refer to Exercise 2.1 and are intended to develop the notion of embed- 
dings of number fields to complement the topic in this section. 


2.2. We define the field polynomial of a over F' to be 


fa.e(t) = [](«- 04(a)). 


j=l 
Establish each of the following. 


(a) Let 6 € Q(a) be an algebraic number of degree s over Q. Then d/s =t € N and 


fa,r (2) = (mp,e(2))*. 


Conclude that 6;(8) for 7 =1,2,...,s are the roots of mg,o(x), each repeated t 
times in the factorization of f(a) € Q|z]. 


(b) If F = Q(a) is a number field of degree d over Q and there are exactly s distinct 
conjugate fields Q(a;) = F, then d/s = t € N and each distinct field occurs t 
times. 


(Hint: To establish t © N, see (A.2) on page 325. For the balance, employ Theo- 
rem 1.23 on page 88 and Definition A.15 on page 331.) 


(When s = 1 in part (b) above, the field F is said to be normal over Q. When we 
are dealing with a field of characteristic zero or a finite field, then being a Galois 
extension is tantamount to being a normal extension—see Definition 2.3 on page 59. 
In the more general case, with which we will not be concerned herein, we refer the 
reader to [29], where one may also find a proof of the last assertion.) 


2.3. Prove that for an algebraic number field F’ with a € Op, all of the F-conjugates of a 
are algebraic integers. 


2.4. Prove that if a is in a number field F’, then all F-conjugates of @ are equal if and only 
ifa~aEQ. 


2:1: 


2.5. 


2.6. 


2.7. 


2.8. 


2.9. 
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Prove that if @ is in a number field F’, then all the F-conjugates of a are distinct if 
and only if F = Q(a). 

(Via Exercise 2.5 and in view of the comments made in Exercise 2.2, we see that when 
all F-conjugates a; of a are distinct, then Q(a) = Q(a;) for all such j, namely F is 
Galois over Q. Another way of putting this is that every polynomial f(x) € Fa] which 
has a root in F' splits completely into linear factors, meaning that F is a splitting field 
for f—see Definition A.17 on page 334.) 


Let £/F be an extension of number fields and let 6 be an embedding of FE into C that 
fixes F’ pointwise, namely 0(f) = f for all f € F. Then @ is called an F-isomorphism 
of E. If 6 is an F-isomorphism of E = F(a), then 6(q) is called a conjugate of a over 
F. Prove that every embedding of F' in C extends to exactly |E : F'| embeddings of 
E in C. Conclude that there are |E : F| F-isomorphisms of E. 


(Hint: Use induction and employ (A.2) on page 325 together with Theorem 1.24 on 
page 39.) 


(This exercise deals with one of the classic questions in the theory of field extensions, 
applied to our number field case. If 80 is an isomorphism of a field F and E is a field 
extension of F', when can @ be extended to an isomorphism of E'? Putting it another 
way, when can we find an isomorphism ¢ of E such that ¢|r = 0?—-see the discussion 
surrounding the defining notation (A.5) on page 827 for a reminder of restriction 
maps and Theorem A.15 on page 334 for extensions of isomorphisms.) 


Let a@ be an algebraic integer and suppose that |Q(a) : Q| = 2. Prove that 


Q(a) = Q(Vd) for some squarefree d € Z. 
Find the minimal polynomial of 
a= 4/-2—3/=5 
over Q and determine Gal(/Q) where kK = Q(a). Conclude that 
JK: Q\=4. 
Let n, 4 ng be squarefree integers. Prove that 


K = Qin + Vn2) = Q(Vn1, Vina), 
and determine Gal(i/Q). 


. For n; € Z be squarefree, distinct, and n; 4 1 for j = 1,2. Prove that 


QV + V/n2) : Q| = 4. 


. With reference to Exercise 2.1, suppose that F’ is a number field with embeddings 6; 


such that 6;(F) C R for 7 = 1,2,...,r1. These are called the real embeddings of F. 
The remaining embeddings 0,;(F’) C C —R for j = 1,2,...,r2, are called the complex 
embeddings of F. Show that |F : Q| = r; + 2rg. In this case {r1,1r2} is called the 
signature of F. 


. Prove that the signature, defined in Exercise 2.11, of Q(W2) is {r1,r2} = {1,1}. Show 


that Q(/2) is not Galois over Q. 
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2.13. 


2.14. 


2.15. 


2.16. 
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If F is a field of characteristic p, and n € N, prove that the map given byo: FwF 
defined by a+ a?" is an F,, automorphism of F’. 


Let D; C Dp be integral domains, a € D2, and let f(a) € Dy[x] with deg(f) > 1. 
Establish each of the following. 

(a) (x-a)? | f(a) if and only if f(a) =0 = f’(a). 

(b) If D, is a field and gced(f(x), f’(a)) =1, then f has no multiple roots in D2. 


(c) If D, is a field, f(a) is irreducible in D,[x], and D2 contains a root c of f(x), 
then f(a) has no multiple roots in Dg if and only if f’(c) 4 0. 


(d) If deg(f) =n EN, then f(x) has at most n roots in Dp. 


Let F be a finite field with p” elements. Then F is a splitting field, unique up to 
isomorphism, of «?” — x over F,. 


(Hint: Use Exercises 2.138-2.14.) 


Prove that Theorem 2.3 on page 59 holds for fields of characteristic zero and for finite 
fields. Also, show that if K/F is a finite extension of finite fields, then K’/F is a Galois 
extension with Gal(A/F’) being cyclic. 

(Hint: Use Corollary A.10 on page 834 and the discussion surrounding it, as well as 
Theorem A.16 on page 334 for the first statement. For the second statement, use the 
first statement in conjunction with Exercises 2.13-2.15, ) 


Biography 2.1 Evariste Galois (1811-1832) was born on October 25, 1811 
outside Paris in the village of Bourg-la-Reine, where his father was mayor. In 
1830, he submitted a paper to the Académie des Sciences. Fourier, who was 
secretary of the Académie, took the paper home, died shortly thereafter, and 
the paper was lost. This was not the first misfortune, since in the previous 
year he had submitted a paper to the Académie through Cauchy, who also 
lost that paper. Galois again tried to submit a paper to the Académie, this 
time through Poisson, who rejected the paper as incomprehensible. This paper 
contained the foundations of what we now call Galois theory. Due to his in- 
volvement in the revolution of 1830, Galois was imprisoned. After his release, 
he became involved in a pistol duel, allegedly a politically motivated suicide, 
and was shot through the intestines. Although he was taken to a hospital, 
he died the next morning on May 31, 1832, from peritonitis. He was not yet 
twenty-one. For a detailed explanation of his life and “pointless death” see [62], 
dedicated to an accounting based on reliable historical documents, rather than 
the mythologized and inaccurate descriptions often found in the literature. 


After his death, Galois’ papers made their way ultimately into the hands of 
Liouville. In September of 1843, Liouville announced to the Académie that he 
found Galois’ work to be correct, concise, and deep. Liouville published Galois’ 
papers in his journal in 1846. Galois’ work, relating the solving of equations by 
radicals to the group of the equation, is of fundamental importance, and may 
be said to have led to an arithmetical approach to algebra. 
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2.2 Norms and Traces 


But all things must come to dust eventually. No human being, no system, no age is 

impervious to this law; everything beneath the stars will perish; the hardest rock will 
be worn away. Nothing endures but words. 

—spoken by Tiro, a Roman scribe, in Lustrum, 

by Robert Harris—see [26, Page 11]* 


*Although [26] is essentially a work of fiction, Marcus Tullius Tiro actually existed and was a 
secretary to the Roman orator and statesman Cicero. Indeed, Tiro wrote the book The Life of Cicero, 
which disappeared after the fall of Rome along with most of his literary output. Tiro ostensibly lived 
to be over one hundred years old and his (shorthand) method of recording has elements that survive 
to this day including the symbol &, for instance. His method, known as Notae Tironianae or more 
commonly the Tironian system of shorthand, was taught in Roman schools and enjoyed widespread 
use over several centuries. 


We introduce some concepts in this section that will be crucial in the development of the 
theory of integral bases and discriminants in §2.3. In §2.1, in particular Exercises 2.1—2.6 
on pages 62-63, we discussed embeddings of an algebraic number field in C. We now use 
this notion to define two fundamental concepts. 


Definition 2.4 — Norms and Traces 


Let F' be an algebraic number field of degree d over Q, and let 0; for 7 = 1,2,...d be the 
embeddings of F in C. For each element a € F’, set 


called the trace of a from F’,, and set 
d 
Ne(a) = [[ (2), 
j=l 
called the norm of a from F. 


The definition of norm and trace was first given by Dedekind in 1871—see Biography 1.3 
on page 29. By Exercise 2.17 on page 68, Tr is additive, and Nr is multiplicative. We will 
substantially generalize Definition 2.4 later—see Definition 5.2 on page 184. 


Example 2.2 Let F = Q(V13), a=1+V13, and 6 = (3+ V13)/2. The embeddings of 


F in C are a _ 
6, : V¥13 + V13, and 62: V13 4 —V13, 


fixing Q pointwise, namely the Q-isomorphisms of F’. Here, 
Nr(a) = 61(a)02(a) = (1 + V13)(1 — V13) = —12, 


aoa (2) ae 


Npr(B) = 01(8)02(B) = ( 5} 2 


Tr(a) = 61(a) + 02(a) = (1+ V13) + (1 — V13) = 2, 
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and 
Te (8) = 6:(8) + 60(8) = 2+¥E , 3 VIB _ 
Also, 
Nr(af) = wr (+ vim (24378)) — pie avi = 
8? — 4-13 = 12 = (—12)(-1) = Nr(a)Np(8), 
and 


Tr(a+ 6) =Tpr (u Vi) (3%)) eis (38) 7 


5 =2+3=Tr(a) + Tr(8). 


Example 2.2 illustrates some general properties of norms and traces. 


Theorem 2.5 — Properties of Norms and Traces in Subfields 
Let F' be an algebraic number field of degree n over Q, and a € F with |Q(a) : Q| =d. If 
Q = Q1,09,...,a¢ are all of the conjugates of a over Q, namely the roots of ma, r(x), then 


and 
d 
Tes} = Wow (ay"/4. 


Furthermore, 


Me,o(L) = gt To(a) (a)x¢-1 freee No(a) (a). 
Proof. Let the embeddings of Q(a) in C be given by 
gj(a)rra; (1<j <4), 
where ¢;(q) = 4 for all g € Q. Thus, by Definition 2.4 on the previous page, 


d 


d 
To(a) (a) -_ os a,j, and Nova) (a) — II Qj. 


j= 1 


&. 
Il 
nn 


By Exercise 2.6 on page 63, each of the ¢;, for i = 1,2,...,d, extends to exactly n/d 
embeddings of F' in C, which we will denote by 


0, for j =1,2,...,n/d. 


Therefore, 
d n/d 


d d 
= SOY MM) = Yr Far= Fa 
i=l i=1 


i=1 j=1 


2.2. Norms and Traces 67 


and 


d nid d d rage 
Ne(a) =] [J 9 (@) = [J a?’ = (Il-:) ; 


i=1j=1 i=l 


Finally, in the expansion of 
Ma,Q(x) = [[@ — aj), 


we see that the constant term must be 


d 
= II a= Noga) (a), 
t=1 


whereas the coefficient of «¢~! must be 


This completes the proof. 


Corollary 2.2 If a € F, an algebraic number field, then 
Tr(a) € Q, and Nr(a) €Q. 


Proof. By Theorem 2.5, we need only show that Ng(a)(@), Toa)(@) € Q. However, this is 
immediate since, by the theorem, 


Ma,o(z) = et — Too) (a)a** +++++ Nea)(@) € Q[a], 


which secures the result. 


Corollary 2.3 Let a € N, and let ma,g(x) be the minimal polynomial of a over Q. Then 
a € A if and only if ma,q(#) € Z[z]. Furthermore, if a € A, then 


Tr(a) € Z, and Nr(a) € Z. 


Proof. Suppose that me,o(x) € Q|z] where a € A, and a is a root of a monic polynomial 
f(z) € Z[x] of least possible degree. Then ma,g(x) | f(x) in Q[z] by Theorem 1.23 on 
page 38. However, since ma,g(x) is monic, then by Gauss’s Lemma A.1 on page 332, we 
must have mq,9(x) € Z[z], so f(x) = ma,g(x). Conversely, if me,e(x) € Zia], then a € A 
by definition. 

To prove the final statement we note that by Theorem 2.5, 


Me,Q(L) = gt — To(c) (a)x4-1 +--+ Nova) (a), 


and by the above ma,g(a) € Z when a € A, so the result follows. 


The notions of trace and norm are also linked to the discriminant of a polynomial introduced 
in Exercise 2.29 on page 69. The reader will be familiar with the details of the following 
from Example 1.29 on page 46. 
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Example 2.3 Consider the irreducible quadratic polynomial 
f(x) = ax? + be +c€ Qa], 
where a 4 0. As mentioned in Example 1.29, the roots of f(a) are given by 


—b+VA 12 ba 
a= ——— _, and a = ———__, 
2a 2a 
where A= 0b? — 4ac is the discriminant of the quadratic field Q(a) = Q(WA)—see Theo- 
rem 1.27 on page 44 and the discussion surrounding it. Therefore, 


—b+VA —-b-VA 
= —b/a, 
2a 2a 


Tr(a) = Toa) (a) =at a! — 


and 


=c/a. 


Nr(@) = Naa)(a) = aa! = (- i) (* a) 7 a 


Hence, the minimal polynomial of a over Q is Ma,9(x) = 2? — Tr(a)a + Nr(a). 


Exercises 
2.17. With reference to Definition 2.4 on page 65, prove that 
Tr(a + 8) = Tr(a) + Tr(8), and Nr(a8) = Nr(a)Nr(8), 
for alla,@ € F. Also, prove that for any q € Q, 
Tr (qa) = qT p(a), and Np(qa) = q“Np(a). 
(Thus, in particular, if a = 1, then Tp(q) = q, and Nr(q) = q?.) 


2.18. Let n € Z be cubefree (namely p? { n for any prime p). Also, let a = ¢/n, F = Q(a), 
and mMa,9(x) = «3 — n. Find disc(ma,g) by employing Exercise 2.31. Furthermore, 
set 

B=(e? ta+4+1)/3, with n= +1 (mod 9), 


where the + signs correspond as given. Find Tr(6), Nr(8), and mg,9(x). Conclude 
that 6 is an algebraic integer in F’. 


(Fields of the form Q( %n) for cube-free n are called pure cubic fields.) 
2.19. Let F = Q(V7), and a = (1+ V7)/2. Find Nr(a), Tr(a), and mo,Q(2). 
2.20. Prove that there are no elements having norm 3 from Q(V—1). 


2.21. Let F = Q(,/p) where p = +3(mod 8) is prime. Show that there is no a € F such 
that Nr(a) =25 


2.22. Find the minimal polynomial of \/—2 — 3/—5 over Q. 
2.23. Find the minimal polynomial of J/2+ V3 over Q. 


In Exercises 2.24-2.26, we assume that F = Q(¢,) for a prime p. 
2.24. Prove that Tr(¢,) = —1, and Nr(1—¢,) =p. 
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. Prove that Tr(1—¢2) = p, where j € {1,2,...,p— 1}. 


(Hint: Use Example 1.5 on page 2.) 


. Let a be an algebraic integer in F’. Prove that Tr(a(1— ¢,)) € pZ. 

. Let g = (1+ V5)/2 be the golden ratio. Prove that ¢3 € Q(g + ¢3). 

. Prove that g € Q(g + ¢3). 

. Let f(x) = «4 — 2 and let a = V2 be areal root of f(x). Prove that F = Q(a,#) is 


the splitting field for f over Q. See Definition A.17 on page 334. 


The remaining exercises allow us a segue into §2.3, where we generalize the notion of a 
field discriminant given for quadratic fields in Definition 1.33 on page 46. 


2.29. 


If f(z) € Fla] where FC C is a field, deg(f) = d > 1, and 
d 
f(x) =a] [(@-4;),a; € F, 
j= 


then the discriminant of f is defined by 


disc(f) = a?¢-1 II (a; — a;)?, 


1<i<j<d 


where a; for 7 = 1,2,...,d are the roots of f in C. 


Prove that for an odd prime p and a primitive p-th root of unity 


disc(m,o)= [|] (-@)? =(-1)@-2p?-?. 


1<i<j<p-1 


(Hint: First prove that: mg,,9(x) = ae. x.) 


. Find the discriminant of the quadratic polynomial f given in Example 2.3 on the 


facing page by applying Exercise 2.29. Also, show that if m’ is the formal derivative, 
then 


dise(m, 9(z)) =—Ne(m, 9(a)). 


. Exercise 2.30 motivates the following more general result. Suppose that a € A and 


F = Q(a) is an algebraic number field of degree d over Q, and a = a1, Q2,...,@q are 
the conjugates of a over Q. Prove that 


d 
dise(ma,) = (-1)%*- 9? TT mi o(ag) = (-)%* PP Nw (mo(a5)), 
j=l 


where m{, g is the formal derivative of ma,g. 
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2.3. Integral Bases and Discriminants 


The mathematician is fascinated with the marvelous beauty of the forms he constructs, 
and in their beauty he finds everlasting truth. 
James Byrnie Shaw (1866-1948), mathematician/philosopher—see [63] 


Given a number field F’', we know from Theorem 1.24 on page 39 that there is an algebraic 
integer a such that F = Q(a). Moreover, every 6 € F' may be uniquely represented in the 


form 


B=q+quat-::+qa-107* € Qal, 


where d = |F : Q|. In other words, {1,a,07,...,v7~!} is a Q-basis for F. Moreover, since 
Op is Noetherian by Corollary 1.13 on page 37, then in particular, Dp is finitely generated 
as a Z-module so now we seek a Z-basis for it. 


Definition 2.5 — Integral Bases 


If Or is the ring of integers of a number field F’, a basis for Or over Z, or simply a Z-basis 
for Op, is called an integral basis for Op. 


Remark 2.2 By Exercise 2.32 on page 81, an integral basis for Or in the sense of Defi- 
nition 2.5 is a basis in the sense of Definition A.7 on page 324. 


Example 2.4 If F = Q(V2), then Or = Z[V/2], by Theorem 1.28 on page 45. Thus, 
B = {1, V2} is an integral basis for F. 


Example 2.5 If F = Q(/13), then by Theorem 1.28 
Op =Z[(1 + V13)/2] 4 Z[V13). 


Here a = (1+ V13)/2 is a root of ma.g(x) = 2? — x — 3, whereas 8B = V13 is a root of 
x? — 13. Thus, although {1, 3} is a basis for F consisting of algebraic integers, it is not an 
integral basis for F’. An integral basis for F' is {1, a}. 


The rings of integers in Examples 2.4-2.5 both have integral bases. Our immediate task is 
first to verify that any ring of integers Or of an algebraic number field F' has an integral 
basis. In order to do this, we first need the following notion. The reader should have famil- 
iarity with the basics of matrices and fundamental linear algebra as outlined in Appendix 


A. 

Definition 2.6 — Discriminant of a Basis 

Let F’' = Q(a) be an algebraic number field with |F' : Q| =d. If 
B= {aj,Q2,...,aa} 


is a Q-basis for F, and 0; (1 < j < d) are all of the embeddings of F in C, then the 
discriminant of the basis is given by 


disc(B) = det(0;(a;))’, 


where det denotes the determinant of the matrix with entry 0;(a;) in the i‘” row and j*” 
column. 
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In particular, if 
B = {1,a,...,a7 4}, 


then the determinant of the matrix (;(a’~')) is called the Vandermonde determinant and 


has value 
det(4;(a'-')) = T] (az—a4), (2.7) 
1<i<j<d 


by Exercise 2.33, where a, = 6;(a) is the k'” conjugate of a for k = 1,2,...,d. 
Example 2.6 In Example 2.4, B = {1, V2} is an integral basis for F, and 
,:V26 V2, and 0): V2+> —/2, 


are the embeddings of F in C. Thus, 


dise(B) = det(9;(a‘~"))? = det ( av wi?) : 


act ( i i, ) = (-2V2)? =8. 


Notice that in Example 2.6, disc(B) = disc(ma,g), where Ma,9(x) = x2? — 2—see Exer- 
cise 2.35 on page 82. This is an illustration of a more general phenomenon given as follows. 


Theorem 2.6 — Discriminants of Bases and Minimal Polynomials 
Let a € A and suppose that B = {1,a,a7,...,a41} is a basis for Q(a) over Q. Then 


disc(B) = disc(ma,g), 
where ™mq,9() is the minimal polynomial of a over Q. 


Proof. Let a1, a2,...,Q@a be the conjugates of a over Q. By (2.7), 


disc(B) = II (a; — a)’, 


1<i<j<d 


and by Exercise 2.29 on page 69, this is equal to disc(™ma,q). 


Now we demonstrate that the discriminants of two bases for a number field form a quotient 
that is a square of a nonzero rational number. 


Theorem 2.7 — Discriminants of Two Bases 


Let By = {a1,Q2,...,a} and By = {f1, B2,..., Ba} be two Q-bases for an algebraic number 
field F. Then 
disc(By) = D?disc(B1), 


where D = det(q,,i) € Q, D #0, and the g,,; € Q are determined by 


d 
Br= oan iei, (Gna € Q)- 
t=1 


Moreover, D € Z provided that B, is an integral basis and Bz € Op. 
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Proof. Let 0;, (1 < j < d) be the embeddings of F in C. The representations 8, = 
peor dki@i, imply that 
d 
0; (8%) = S- Gi; (Qu), 
i=1 
for each k = 1,2,...,d. Hence, we get a matrix equation: 


A1(B1) 92(81) +++ a(Br) 
A1(B2) 2(82) +++ Ga(B2) 


9:(82) 92(Ba) --- 4(Ba) 


Git Gide “oN id 6;(a1) 62(a1) an 6a(a1) 
G21 92,2 °** 2d 6; (a2) 62(a2) a 6a(az) 
dal Wd,2.°** ~~ dd Oi:(aa) O2(aa) ++: Oa(aa) 


By taking determinants, and squaring, we get the equation: 
disc(By) = D?disc(B1), 


with D = det(M), where 


M11 912 ‘"° Wid 

q2.1 922 ‘** 2d 
M= : : . . ) 

ddl Wd2 °° dd 


as required. 


Example 2.7 Let F = Q(vV13), a = (14+ V/13)/2, and 8 = V/13. In Example 2.5 on 
page 70, we saw that B, = {1,a} and By = {1,6} are bases for F, the former being 
integral, and the latter not integral, but merely a basis over Q. Since 


6, : V13 4 V13, and 6) : V13 4 —V13 


are the embeddings of F in C, then 
2 
ig _ (gi))2 — Ai(1) — A2(1) 
disc(Bz) = det(@;(6"))” = det ( 6,(V13)  6o(v13) 


=aet (5 : ) = (avi)? = 92, 


—-V13 
and ee ee ( 6; (1) 62(1) i 
1SC = de (a = de 
: : 0,(44¥8)  9,(14¥58) 
1 1 : 
= det ( 14/13 1-13 ) a (—V13)? = 13. 
2 2 
Thus, 


disc(Bz) = 2?disc(B,). 
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Here (4 
D=2=<det ( 12 ) 
since 
2 =1l=H1-01+q,202 =1-1+0-a, 
and 
fy = B= VIS = goa + dae =—1-142- 4B, 


We are now in a position to relate the notion of discriminant introduced in Definition 2.6 
on page 70 with the notions introduced in §2.2. See Exercise 2.1 on page 62 for a reminder 
of terminology and notions surrounding what follows. 


Theorem 2.8 — Discriminants as Traces 

If B = {a1,a2,..., Qa} is a Q-basis for an algebraic number field F' = Q(a), then 
A = disc(B) = det(Tr(aia;)) € Q, 

andA £0. Furthermore, if F’ is a totally real field, thenA > 0. 


Proof. Since A = disc(B) = det(6;(a;))?, then from the properties of determinants (see 
Theorem A.19 on page 336), we get: 


d 


det(0;(a;))? = det (>. tla) = det (Tr(aia;)), 


k=1 


so A = det(Tr(a;a;)). Therefore, by Corollary 2.2 on page 67,A € Q. It remains to show 
that A is nonzero and also positive when F' is totally real. 


Let B,; = B. By Theorem 1.24 on page 39, 
Bo = {1,a,07,...,a7 4} 


is a basis for F over Q. Thus, by Theorem 2.7, disc(B2) = D?disc(Bi), where D is given in 
that theorem. However, by Exercise 2.33 on page 81, 


dise(B2)= |] (a;-ai)’, (2.8) 


1<i<j<d 


and the a; are distinct so disc(B2) 4 0. Hence, disc(B,) 4 0. 
Since Bo is a basis for F over Q, then by Theorem 2.7, 


disc(B,) = d?disc(By). 


However, by (2.8), disc(Bz) is a square. Since disc(B,) 4 0, so given that F' is totally real, 
all of the a; are real, so disc(B1) > 0. 


Corollary 2.4 If B is a basis for F’ over Q with B C Or, then disc(B) € Z. 


Proof. This is immediate from Corollary 2.3 on page 67. 
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Example 2.8 Consider Example 2.7 again. F = Q(v13) is a totally real field with integral 
basis 


By = Ak (1 TF V'13)/2} = {1, a} = {a1, a2}, 
and a non-integral Q-basis 
Ba = {1, V13} = {1,8} = {f1, Ba}. 
Also, since the matrix 


(Te(aias)) = (Fh 


> 
Q 
y 


then 


Also, since we have the matrix 


dise(B1) = det(T'r(aja;)) = act ( ; ) ais, 


(Tr(B:8;)) = ( Tr(1) Tr(8) 


then 
disc(B2) =52= det (Tr (8; 8;)). 


Corollary 2.5 Let B; = {a1,a2,...,aa} be a Q-basis for an algebraic number field F’. If 
Bo = {31, B2,.--, Ba} Cc F and 


d 
Br = SS ak iM for qx; € F, and k = 1,2,...,d, 


i=l 
then Bg is also a basis for F if and only if det(q,;) 4 0. 


Proof. Suppose that det(q,,;) 4 0. It suffices to show that the 6; are linearly independent 
by Theorem A.4 on page 325. If 


d 
So 1be =9 (% €F), 
k=1 


then 
d 


d d d 
0= Se Vk y! dk iC = S- ay S- Vkdk,i- 
k=1 i=1 k=1 


i=1 


Since the a; are linearly independent, then 


d 
= Vkdk,i = 0. 
k=l 


Since det(q,i) #0, then y, = 0 for all k = 1,2,...,d. 
Conversely, if Bz is a basis for F’', then by Theorem 2.7 on page 71, 


disc(Bz) = D?disc(B,). 
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Hence, by Theorem 2.8 the result follows. 


In Example 2.5 on page 70, we saw that a Q-basis for an algebraic number field F’, consisting 
of algebraic integers, need not be an integral basis for Ff. The problem is that a basis 
consisting of algebraic integers may span F’ without spanning Dr as a Z-module. We now 
verify that every algebraic number field does indeed have an integral basis, and that the 
ring of integers is a free abelian group of rank equal to the degree of the number field over 


Q. 


Theorem 2.9 — Existence of Integral Bases 


Every algebraic number field F' of degree d over Q has an integral basis, and Op is a free 
abelian group of rank d. 


Proof. By Lemma 1.4 on page 38, there is a basis for F' consisting of elements from Of. 
This establishes existence of such bases. It remains to show that there exists such a basis 
that is a Z-basis for Dr. 


By Corollary 2.4, the discriminants of such bases are in Z, and by Theorem 2.8, they are 
nonzero. Hence, we may choose a basis 


Bi = {f1, Bo,..-, Ba} COr 


for F over Q such that |disc(B;)| is a minimum. Assume that B, is not a Z-basis for Op. 
Therefore, there exists a y € Or such that 


d 
Y= 546; (G €Q), 
j=l 


and at least one gq; ¢ Z. Without loss of generality, assume that q, ¢ Z. Thus, 
ga=lalt+r, (0<r<1) 
where |qi| is the floor of gj—see Page 8. Set 
6=7-|nlA = S u8s- La ese 
j=2 


The determinant of the matrix: 


Pr q2 dd 
0 1 0 
A= ; 
0 O 1 
is 
det(A) =r 40. 
By Corollary 2.5, 
Bo = {6,8 2,..-,Ba} 


is a basis for F' over Q. Since 
disc(Bz) = r?disc(B,), 


then 
|disc(Bz)| < |disc(B,)|, 
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contradicting the minimality of the discriminant of B;. Hence, By, is an integral basis for 
F.. Therefore, as a Z-module 
Or =Z6, ®---@ZBa, 


so Or is a free abelian group of rank d (see Equation (A.4) on page 325, and the discussion 
preceding it). 


Corollary 2.6 If B C Op is a Q-basis for F and disc(B) is squarefree, then B is an integral 
basis for F’. 


Proof. Let B = {f1,...,6a}. By Theorem 2.9, there exists an integral basis B, = 
{a,,...,a@qa} for F. By Theorem 2.7 on page 71, 


disc(B) = D?disc(B,), 
where D = det(qx,i), and qx; is given by 


d 
Be= >~qnioi (an, € Q). 
i=1 


Since disc(B) is squarefree, then D = +1. Therefore, (qx) € GLn(Z). Thus, by Exer- 
cise 2.34 on page 81, B is a Z-basis for Or. Thus, B is an integral basis for F’. 


Example 2.9 Example 2.5 on page 70 provides an example of a squarefree discriminant 
of an integral basis. However, in Example 2.4, B = {1, V2} is an integral basis for Q(V/2), 
but disc(B) = 8, so the converse of Corollary 2.6 fails to hold. 


Although Example 2.9 shows that the converse of Corollary 2.6 fails to hold, if we have two 
integral bases for an algebraic number field, then they must have the same discriminant. 


Corollary 2.7 Let B; and Beg be two integral bases for an algebraic number field F’. Then 
disc(B,) = disc(B2). 


Proof. By Theorem 2.7, 
disc(By) = Ddisc(B1) (2.9) 


where D € Z is given in that theorem. Thus, 
disc(B,) | dise(Bz) € Z, 

by Corollary 2.4 on page 73. By reversing the roles of By and Bo, we get 
disc(Bz) | dise(B,) € Z. 


Therefore, 


disc(B,) = +disc(Bo). 


However, by Equation (2.9), the minus sign is not possible. 
Corollary 2.7 essentially tells us that the discriminant of an integral basis for an algebraic 
number field is an invariant of the field, and it has a name. The following generalizes the 
notion for the quadratic case given in Definition 1.33 on page 46. 
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Definition 2.7 — Discriminant of a Field 


Let B be an integral basis for an algebraic number field F. Then the discriminant of F’ is 
disc(B), denoted byA pr. 


Application 2.1 — Quadratic Fields 
The ring of integers of a quadratic number field F is given by Dr = Z[wa,] where 


7 (1+ /Ap)/2 if A=1(mod 4), 
i a ifA » #1(mod 4) 


is called the principal surd. —see Application 1.2 on page 3 and Theorem 1.28 on page 45. 
Also, 


ip Ar ifA ~ =1(mod 4), 
P™) Dp/4 if A#1(mod 4) 


is called the radicand of F. 


Example 2.10 Let F = Q(v19). By Theorem 1.28, B = {1, V19} is an integral basis for 
F. Thus, 


it 1 
J19 —/19 


2 
Ap = dise() = det ( ) = (-2V19)? = 76 =4-19=4Dp. 
Example 2.11 Let F = Q(vV/13). Then 

B = {1, (1+ V13)/2} 
is an integral basis for F by Theorem 1.28. Thus, 


2 
1 1 

Ap = dise(B) = det ( uv 1-viB ) =13= Dr. 

2 


2 


Now we provide a generalization of the quadratic version promised in Remark 1.22 on 
page 46—see Biography 1.4 on page 54. 


Theorem 2.10 — Stickelberger’s Theorem 
If F is an algebraic number field, then 


Ar =0,1 (mod 4). 


Proof. Let B = {aj,...,@n} be an integral basis for F', where |F : Q| = n. For each 
i=1,2,...,n, let aj, a”), ae as” (not to be confused with the powers of a;) be all of the 


conjugates of a; over Q. By part (d) of Theorem A.19 on page 336 as 


V/Ap = det(a) 
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is essentially the sum of n! terms, each one corresponding to an element of the symmetric 
group S,, we may set o(a) € S, as the bijection assigning each a?) to an element of S;,. 


Therefore, we may refine this sum further in terms of the alternating group A, as follows. 
Vr = det(a”) = LS a) — ye a? =e-9, 
o(at? )EAn o(at? )€An 


so e,o € A. By Exercise 2.1 on page 62, we have that, for each embedding 0; of F in C, 
6;(e +0) =e-+0, and 6;(e0) = eo so, by Exercise 2.4, e+0,e0 € Q. Thus, by Corollary 1.11 
on page 37, e+0,e0 € Z. Therefore, 


Ap = (e— 0)? = (e+ 0)? — 4e0 = (e +0)? (mod 4), 


thenA - = 0,1(mod 4), as required. 
The above proof was published in 1929 by I. Schur (1875-1941), a student of G. Frobe- 
nius—see Biographies 2.3 on page 80 and 2.4 on page 81 . 


The next result tells us the effect on the discriminant of a field by the signature given in 
Exercise 2.11 on page 63. The following also generalizes the last statement of Theorem 2.8 
on page 73. This is a result of Kronecker—see Biography 2.2. 


Theorem 2.11 — Signatures and Discriminants 


If F is an algebraic number field with signature {r1,r2}, then the sign ofA p is (—1)". In 
other words,A - > 0 if and only if half the number of complex embeddings is even. 


Proof. Let B = {ay,...,Qn} be an integral basis for F’, where 
|F:Ql=n. 
(9) 


Since det(a;"") € C, we may write it as 


det(ad) =a+b/—1 (a,bER). 
Then det (a) = a—by/—1 ,where the & denotes the complex conjugate of x. Since complex 


conjugation will leave the real rows of the determinant unchanged, and will interchange 
the 2r2 “non-real” rows in pairs corresponding to the conjugate embeddings, the value of 


det (a!”) is also (—1)"?(a + bV—1). Therefore, 


(—1)"(a+bV/-1) =a —bV-1. 


If rg is even, then comparison of coefficients yields that b = 0, andA p = a? > 0. If ro is 
odd, then a = 0, so 
Ap = (bV—1)? = -0? < 0, 


as required. 


Example 2.12 If F = Q(\2), there are two complex embeddings, and one real embedding, 
namely r; = 1 = re, as seen in Exercise 2.12. Also, from Exercise 2.18 on page 68, it follows 
that 

Ap = —27-2? = —108 = (—1)"?108. 
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Biography 2.2 Leopold Kronecker (1823-1891) was born on December 7, 
1823 in Liegnitz, Prussia (now Legnica, Poland). In 1841, he entered the Uni- 
versity of Berlin and achieved a doctorate under Dirichlet’s supervision in 1845. 
Then he left for Silesia where he became wealthy in banking and real estate. He 
returned to Berlin in 1855 and remained there for the rest of his life. However, 
he did not become a professor there until 1883 when his lifelong friend Kummer 
retired. Kronecker was known as a finitist, believing that mathematics would 
be well-served by consideration of only finite numbers and a finite number of 
steps. This naturally brought him into conflict with the likes of Cantor. In 
fact, Kronecker was known for his vigorous personal attacks on anyone with 
whom he had mathematical disagreements. His contributions were mainly to 
algebraic number theory, the theory of algebraic equations, and elliptic func- 
tions. Along with Kummer and Dedekind, Kronecker is generally considered to 
be the third father of modern algebraic number theory. He died on December 
29, 1891 from bronchial illness. 


We conclude this section with an observation, which also serves as a caution, concerning 
integral bases. 


Remark 2.3 In view of Theorem 1.24 on page 39, the reader may be tempted into thinking 
that O 7 = Zia] where a € A for any number field F’. In other words, one might be lured into 
the belief that there is always an integral basis of the form {1,a,a?,...,a¢~!}. However, 
this is false, as the following illustration demonstrates. For criteria when this does happen 
see Exercise 5.48 on page 253. 


Example 2.13 The following was first presented in [44]. However, our proof is different and 
more detailed for the edification of the reader. Let K = Q(./—7, /—14), F = Q(V—14), and 
Or = Z|V—14]. We seek to establish that there is no 8 € Ox such that Ox = Z[A]. First, 
we show that there is no a € Ox such that Ox = Zia, /—14]. By way of contradiction, 
suppose there is such an a. Then, in particular, 


_14+V-7 
= 


A = 710+, where A € Dx,91,72 € OF 


and 

V—14//—7 = V2 = Ba + Bo where V2 € Ox, and fi, Bo € Or. 
Let 6 be the embedding of K in C given by 0: /—71) —V—7 and 0: V—14 V—I4. In 
other words, by Theorem 2.3 on page 59, (0) = Gal(K/F), fixing F' pointwise. Therefore, 


eS, 
-—— 


MAGA Sx/a7 =e 06). (2.10) 
6(V2) = -V2 = B0(a) + Bo, 


6(A) = 719(a) + 42, 


and 


V2 — (V2) = 2V2 = Brat Be — B10(a) — B2 = Bi(a— O(a). (2.11) 
Squaring (2.10)—(2.11) and taking norms from F: 


7 = Np("1)°Nr(a— 0(a))? and 2° = Np(6,)?Npr(a — 6(a))?. 
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It follows from Corollary 2.3 on page 67 that Ne(a — 6(a)) = £1 since Ne(a — O(a)) € Z 
and divides both 7? and 2°. Thus, Nr(y1) = +7. However, y, = a+ b\/—14 for some 
a,b € Z so a? + 14b? = +7 which is impossible. We have shown that there is no a € Ox 
such that Ox = Zla, /—14]. Now if there is a 6 € Ox such that Ox = Z(G], then by 
setting a = 8— /—-14 € Ox, we get Ox = Zila, /—14], which we have just shown to be 
impossible. 


Biography 2.3 Ferdinand Georg Frobenius (1849-1917) was born on October 
26, 1849 in Berlin-Charlottenburg, Prussia (now Germany), the son of a Protes- 
tant parson. He began his university studies at Gottingen for one semester, 
then returned to Berlin. At the University of Berlin, he was instructed by 
the likes of Kronecker, Kummer, and Weierstrass, the latter being his doctoral 
supervisor under whom he completed his dissertation in 1870. After some po- 
sitions at secondary school level, he was appointed to the University of Berlin 
as an extraordinary professor of mathematics in 1874. Note that Frobenius 
somehow bypassed the usual requirement for a Habilitation—see Footnote 1.1 
on page 23. The consensus is that this breach of usual strictness was due to 
Weierstrass’ influence. In 1875, after only a year at Berlin, Frobenius took a 
position as ordinary professor at the Eidgendssische Polytechnikum in Ziirich. 
Frobenius worked in Zurich for seventeen years where he married and raised 
a family. When Kronecker died in 1891, Weierstrass exerted further influence 
to have Frobenius fill the vacant chair at Berlin. For a quarter century, from 
1892, Frobenius was the leading influence in Berlin where he died on August 
3, 1917. Among his students were Edmund Landau, Robert Remak, and Issai 
Schur—see Biography 2.4. It is also noteworthy that Siegel was Frobenius’ 
student from 1915 until his death. 


Frobenius contributed to a vast array of mathematical areas, among them 
being analytic functions in series, linear differential equations, linear forms with 
integer coefficients, elliptic and Jacobi functions, biquadratic forms, and group 
theory, to name a very few. In group theory, he extended Sylow’s theorems from 
permutation groups to abstract groups, and provided a proof of the structure 
theorem for finitely generated abelian groups. But arguably his most influential 
contribution may have been in the area of group characters which he ultimately 
linked to representations and essentially gave birth to representation theory of 
groups. Indeed, in 1911 Burnside wrote up Frobenius’ character theory in his 
book Theory of Groups of Finite Order. Later, in other areas, such as quantum 
mechanics and theoretical physics, Frobenius’s group theoretic representations 
found new applications. 


Remark 2.4 Recall that Theorem 1.24 on page 39 is the primitive element theorem for 
algebraic number fields. In other words, any algebraic number field F' is generated over Q 
by a primitive element a € N. Therefore, Example 2.13 shows that there cannot exist a 
Primitive Element Theorem for rings of integers of algebraic number fields. Bases of the 
form {1,a,a7,...,a¢'} for a € A are called power integral bases, and Or = Za] is called 
monogenic. Hence, not all rings of integers of algebraic number fields have a power integral 
basis, namely they are not all monogenic. 
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Biography 2.4 Issai Schur (1875-1941) was born on January 10, 1875 in 
Mogilyov, in the province of the same name in what was then the Russian 
Empire, and is now Belarus. His university education began in Berlin 1894, 
where Frobenius was one of his teachers early on and ultimately his doctoral 
advisor—see Biography 2.3. By 1901 he had obtained his doctorate on a topic 
involving representations of the general linear group over C. His thesis intro- 
duced functions that we now call S-functions in honour of Schur’s contribution. 
He began his professional life as a lecturer at Berlin University in 1903, and was 
ultimately promoted to full professor in Berlin in 1919. He held this position 
until ousted by the Nazis in 1935. While at Berlin, he directed students in many 
disparate directions including combinatorics, matrix theory, and soluble groups. 
Among his doctoral students were both Richard and Alfred Brauer (brothers), 
Robert Frucht, Bernard Neumann, Richard Rado, and Helmut Wieland. After 
Schur was dismissed from his chair in 1935, he was also pressured to resign 
from the Prussian Academy in 1938. The academy had honoured him in 1922 
with his election to the august body. In 1939, he left for Palestine, broken by 
the stress and humiliation he suffered under persecution by the Nazis. Two 
years later, he died in Tel Aviv, Palestine (now Israel). 


Among Schur’s achievements was his discovery of what we now call the Schur 
multiplier. This proved to be well in advance of its time. Indeed, as evidence 
of this fact, some forty years later Eilenberg and MacLane defined cohomology 
groups, the second of which having coefficients in C — {0} is actually the Schur 
multiplier. However, Eilenberg and MacLane were unaware of this fact. Schur 
was interested in representation theory of groups, which began with his doctoral 
thesis and culminated many years later in his complete description of rational 
representations of the general linear group. He also worked on projective rep- 
resentations of groups and group characters. In this area he is known for what 
we now call Schur’s Lemma that says: If R and S are two finite-dimensional 
irreducible representations of a group G and ¢ is linear map from R to S' that 
commutes with the action of the group, then either ¢ is invertible, or ¢ = 0. 


His interests included Galois groups of certain classes of polynomials such as 
Hermite polynomials. He also worked in divergent series, function theory, in- 
tegral equations, and number theory. 


Exercises 


2.32. Prove that a Z-basis for Or in the sense of Definition 2.5 on page 70 is a basis 


sense of Definition A.7 on page 324. 


2.33. Let R be a commutative ring with identity and let a,,...,a@q © R. Prove that 


2.34. Let G be a free abelian group of rank n with basis {g1,... 


det(ai')= [J] (aj -a%). 


1<i<j<d 


A= (a5) € Mnxn(Z). Prove that the elements 


hi = 0 04,595 (i =1,2,...,n), 
j=l 


in the 


,9n}, and suppose that 


form a basis for G if and only if A € GL,(Z). (See Definition A.18 on page 337.) 
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2.35. 
2.36. 


2.37. 
2.38. 


2.39. 


2.40. 


2.41. 


2.42. 


2.43. 


ve 2.44. 


2. Field Extensions 


Using Exercise 2.31 on page 69, find disc(maq) when a = V2. 
Let F = Q(W5) = Q(a). Find Tr(a), Nr(a), and disc(ma,o), where mMa,9(2) = 
r* — 5. Also, show that disc(mo,9) = Nr(m/, 9(@)) 
Let F = Q(W5, G4), and a= W5. Find Tr(a) and Nr(a). 
Let B = {1,a,07,...,a¢ 1} be a basis for Q(a) over Q. Prove that 
disc(B) = disc(™mzq,9) 
where mq,9(2) is the minimal polynomial of a over Q. 


Let F be an algebraic number field with Or = Zla]. Prove thatA - = disc(ma,q) 
where m,,g is the minimal polynomial of a over Q. 


Let R be a Dedekind domain, and let J be an R-ideal with 


for distinct prime R-ideals P;. Prove that 
|R/T| = []1R/P,1%. 
j=l 


(Hint: Use Theorem 1.21 on page 32 and exercises in that section.) 


If R is a commutative ring, and M is an R-module with N an R-submodule of M, 
then N is called a pure submodule of M if NOrM=rWN for all r € R. Prove that if 
N is a direct summand of M, then N is a pure submodule. 


With reference to Exercise 2.41, prove that if O7 C Ox for algebraic number fields 
F CK, then Of is a pure Z-submodule of Ox. Conclude that any integral basis for 
F can be extended to an integral basis for K. 


Let F’ be a number field with basis {6,, 62,...,8,} over Q, and let a € Op be of 
degree d over Q. Suppose that 


0B: = >- 44,58; fori =1,2,...,n. 


j=l 

Prove that |Nr(a)| = | det(a;,;)]. 

Let F’ be an algebraic number field with a € Or, a #0. Prove that 
|De/(a)| = |Nr(o)I, 


where the vertical bars on the left denote the cardinality of the quotient group, con- 
sidered as free abelian groups, and the vertical bars on the right denote the absolute 
value of the norm. In particular, this says that if the right-hand side is 1, then as free 
abelian groups, Or = (a). 

(Hint: Show that the quotient of free abelian groups Dr/(a) is finite by demonstrating 
that Or and its subgroup (a) have the same rank. Then use Exercise 2.48.) 


(This exercise is a segue into §2.4, where we extend the notion of norm from elements 
to ideals and generalize the notion developed for the quadratic case in §1.7.) 
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2.4 Norms of Ideals 


The mathematician may be compared to a designer of garments, who is utterly obliv- 
tous of the creatures whom his garments may fit. To be sure, his art originated in the 
necessity for clothing such creatures, but this was long ago; to this day a shape will 
occasionally appear which will fit into the garment as if the garment had been made 
for it. Then there is no end of surprise and delight! 

from page 142 of The Two Realities in [63] 
Tobias Dantzig (1884-1956) Baltic, German, American mathematician 


Exercise 2.40 as well as Exercise 2.44 provide a lead-in to the following important notion 
which will allow us to refine some developments from earlier in the text and will lead us 
naturally to ideal classes and the class group. 


Definition 2.8 — Norms of Ideals 


Let F be a number field and let J be an (integral) 9 --ideal. Then we define the norm of I 
to be 
N(I) = |9r/T|, 


If J is a fractional ideal of Or then, by Remark 1.13 on page 26, there is a nonzero integral 
OD p-ideal J and an element a € Or such that 


Qo 
I 
Qi 
= 


Then the norm of J is given by 


where N(I) and N((a)) are the norms of the integral ideals J and (a). 


Notice that, via Exercise 2.40, we know that |D-/J| is finite. In fact, if 


rT] 
j=l 


via Theorem 1.17 on page 28, then Exercise 2.40 tells us that 
N(1) = |] 19r/P31”. 
j=l 


Since we have the prime power 
\Dr/P5| = py! 
by Exercise 2.49 on page 86, then 


N(1) = |] py”. 
j=l 


Also, by Exercise 2.47, for any nonzero fractional D -ideals J, J, 


N(Jd) = N(J)N(J). 
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Example 2.14 Let F = Q(V10), with the Dp-ideals P = (2, 10), 2 = (3,1+ V/10), and 
Q’ = (3,1 — V10). (Recall that Or = Z[V10] by Theorem 1.28 on page 45.) We will show 
that P, 2, and Q’ are prime D;-ideals, and compute their norms. Notice that by simply 
multiplying out the basis elements, 


QQ’ = (9,3(1 — V10),3(1 + V10)). 
However, 3 = 9 — (3(1 — V10) + 3(1 + V10)) € 99’, so (3) C 90’, and clearly the elements 
9,3(1 — V10),3(1 + V/10) are in the ideal (3), so 
(3) = 90, 


by Theorem 1.30 on page 49. Similarly, P? = (4,210, 10). However, 2 = 10—2-4 € P?, so 
(2) C P?, and certainly the elements 4,210, 10 are in the ideal (2), so again by Theorem 
1.30, 


Qe 04 
Hence, 
(6) = ??Q0', (2.12) 
- N(P?QQ') = N((6)) = 2? - 3? = 36 = Nr(6). (2.13) 


Notice that this coincides with the fact given in Exercise 2.44 on page 82 since 
|Dr/(6)| = |Or/(6)| = Nr (6), 


where the first quotient is that of a ring modulo an ideal, and the second quotient is as a 
free abelian group modulo a cyclic subgroup. We may also calculate |D r/P| by counting 
its elements. Although there are other means of doing this, we explore this avenue for its 
instructive and illustrative value. First, we observe that P is maximal, for if 


utvvV10 ¢ P = {2a + bV10 : a,b € Z}, 
then u € Z is odd and v € Z is arbitrary. Hence, we have the ideal equality, 
(P,u + vv10) = Z[V10], 
given that u-—1+ vV10 € P, so 
1=u—1+vV10— (ut vv10) € (P,u+vv10). 


By Theorem 1.10 on page 18, P is a prime 9 p-ideal. Thus, every element of Z[V/10] is either 
in P or is of the form 1 +a, where a € ?, so |Z[V10]/P| = 2 = N(P). A similar argument 
shows that every element of Z[V/10] is either in Q or is of one of the forms 3a+b—1+bv10 
or 3a+b—2+bvV10. Therefore, 


IZ[V10]/9| = 3 = N(Q) = N(Q’) = |Z[v'10]/9". 
Therefore, by Exercise 2.45 on page 86, Q is a prime 0 -ideal. Hence, 
N(PQ) = 6 = N(PO’), 
from which we could have deduced (2.13). 
Observe, as we did in Examples 1.9 and 1.11 on pages 4-5, that 
6 = (44+ V10)(4— V10) =2-3 


gives two distinct representations of the element 6 as a product of the irreducible elements 
4+ 10, 4—/10, 2, and 3. However, there is unique factorization of the ideals as given in 
(2.12). 
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The following employs Example 2.14 to illustrate Exercise 2.47 on the following page. 


Example 2.15 Let J = PQ where P and Q are given in Example 2.14. Via Exercise 2.51, 
I = PQ = (6,2 — v10), 


with 
po! = Pd’ = (6,2 + V10). 


Let J and J be fractional D-ideals given by 


= il, 


N(JJ) = N(J)N(4) = (a) ao) 


Then 


I 


(i) (3) = 


sO 


Jd = Z[V10] = Or. 
The following essentially generalizes Exercise 2.44 on page 82, illustrated in Example 2.14. 


Theorem 2.12 — Norms of Ideals and Discriminants 


Suppose that F’ is a number field, and that J is a nonzero integral Dp-ideal. Let B = 
{a1,...,Qn} be a Zbasis for I. Then 


N(D)? = me) 


Proof. Let By = {f1,..., Bn} be a Z-basis of Or. Then for each i = 1,...,n 


a; = eae (24,3 € Z). 
j=l 
By the same reasoning as in the solution, provided on page 378, of Exercise 2.43, 
N(I) = |Or/I| = | det(zi5)|. 
By Theorem 2.7 on page 71, 


disc(B) = (det (z;,;))?disc(B1) = N(1I)?Ap, 


as required. 


An immediate consequence, which is essentially Exercise 2.44, is the following. 


Corollary 2.8 If J is an integral O,-ideal with a € I, then N(I) = |Np(q)| if and only if 


I= (a). 
Example 2.16 If F = Q(/Af) and a = (a+ bVApr)/2 € Op, then 


a? — b2Ap 


N((a)) = |Ne(a)| = 


86 


2. Field Extensions 


Example 2.17 By Exercise 2.49, N(P) = p! for f € N, where ? is an integral prime OD p- 
ideal, then norms of prime ideals are not necessarily primes in Z—see Exercise 2.50. The 
exact nature of this power f will be settled when we discuss general ideal decomposition in 
number fields later in Chapter 5. 


Exercises 


2.45. 


2.46. 


2.47. 


2.48. 


2.49. 


2.50. 


2.51. 
2.52. 


2.53. 


2.54. 


Let F' be a number field and J a nonzero D -ideal. Prove that if N(J) is prime in Z, 
then I is prime in Dr. 


Let F' be a number field and J, J nonzero integral D r-ideals. Prove that 
N(IJ) = N(DN(J). 
Conclude that if an integral ideal J, divides an integral ideal Iz, then N(J1) | N(I2). 


Let F' be a number field and J,J nonzero fractional D p-ideals. Prove that 
N(Jd) = N(I)N(J). 


(Note that, unlike the conclusion in Exercise 2.46, we cannot conclude that N(J) 
divides N(Jg) in Z. Example 2.15 on the previous page provides a countererample to 
the contrary.) 


Let F be a number field and J a nonzero O--ideal. Prove that I | (N(Z)), namely 
that (N(J)) CI. 


Let F' be a number field and let P be a nonzero prime O-ideal. Prove that N(P) = 
p™, where PM Z = (p), for some m € N, where m < |F': QI. 


Suppose that @ € O- is a nonzero nonunit element for a number field F. Prove that if 
|Nr(a)| = p where p is a prime in Z then a is a prime in Dr. Show that the converse 
fails to hold. 


(Hint: Use Theorems 1.8 on page 16 and 1.30 on page 49 as well as Exercises 2.44 
on page 82 and 2.45 above in conjunction with Definition 2.8 on page 83.) 


(Note that this substantially generalizes Exercises 1.5 on page 6 and 1.22 on page 14 
and, in particular, shows that the assumption of UF'D in Exercise 1.22 is not necessary. 
We had to wait until we had the machinery made possible by our developments to this 
point before we could provide this result since it is quite difficult with only elementary 
techniques.) 


Find all ideals in Z[V10] having norm 6. 


Prove that for a Dedekind domain D, and an integral D-ideal I there are only finitely 
many integral D-ideals that divide I. 


Let F be a number field and n € N arbitrary but fixed. Prove that there exist only 
finitely many integral DO p-ideals I with N(I) =n. 


Let F be a number field and let J be an integral D-ideal. Suppose that n € N is the 
smallest positive integer in J. Prove that n | N(J). 


Chapter 3 


Class Groups 


Of all the ruins that of a noble mind is the most deplorable. 

spoken by Sherlock Holmes in His Last Bow (1917) from The Dying Detective. 
Sir Arthur Conan Doyle (1859-1930) 

Scottish-born writer of detective fiction 


In this chapter, we begin with the interplay between ideal and form class groups. This 
allows for a relatively simple proof of the finiteness of the class number in §3.2 for the 
quadratic case. This relatively easy approach is a segue into the general case involving the 
geometry of numbers in §3.3. Some of what follows is adapted from [54]. 


3.1 Binary Quadratic Forms 


Lagrange was the first to introduce the theory of quadratic forms—see Biography 3.3 on 
page 93. The theory was later expanded by Legendre, and greatly magnified even later by 
Gauss—see Biographies 3.1 on page 89 and 3.5 on page 95. An integral binary quadratic 
form is given by 

f(a,y) = ax? + bry + cy” with a,b,c € Z. (3.1) 


For simplicity, we may suppress the variables, and denote f by (a,b,c). The value a is 
called the leading coefficient, the value b is called the middle coefficient, and c is called the 
last coefficient. If gcd(a, b,c) = 1, then we say that f(z, y) is a primitive form. 

The aforementioned three great mathematicians looked at the representation problem: 
Given a binary quadratic form (3.1), which n € Z are represented by f(x,y)? In other 
words, for which n do there exist integers x, y such that f(x,y) =n? If gcd(a, y) = 1, then 
we say that n is properly represented by f(x,y). For instance, when studying criteria for 
the representation of a natural number n as sums of two squares, such as in [53, Section 6.1, 
pp. 243-251], a simple answer can be given. When looking at norm-forms 2? + ny? = m, 
where m,n € Z, such as in [53, Section 7.1, pp. 265-273], the problem can be given a 
relatively simple answer for certain m,n. In general, there is no simple complete answer. 
Moreover, an even more general and difficult problem arises, namely when can an integer 
be represented by a binary quadratic form from a given set of such forms? The theory of 
binary quadratic forms deals with this question via the following notion. In the balance of 
our discussion, we use the term form to mean binary quadratic form. 
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Definition 3.1 — Equivalent Binary Quadratic Forms 


Two forms f(a,y) and g(x,y) are said to be equivalent if there exist integers p,q,r,s, such 
that 


f(a,y) = g(pa + gy, rx + sy) and ps — qr = +1. (3.2) 


For simplicity, we may denote equivalence of f and g by f ~ g. If ps — qr = 1, then f and 
g are said to be properly equivalent, and if ps — gr = —1, they are said to be improperly 
equivalent. Two forms f and g are said to be in the same equivalence class or simply in the 
same class, if f is properly equivalent to g. 


Remark 3.1 From Definition 3.1, equivalent forms represent the same integers, and the 
same is true for proper representation — see Exercise 3.1 on page 94. Moreover, since 


ep ae eres 
aet(? 1) =ps qr=d 


@ : ) E GL(2,Z), 


— see Exercise 1.59 on page 54. Note, as well, that proper equivalence means that ps—qr = 1 


sO 
( ere ) € SL(2,Z), 
Tr § 


the subgroup of GL(2,Z) with elements having determinant 1. Properly equivalent forms 
are said to be related by a unimodular transformation, namely X = pxr+qy and Y = ra+sy 
with ps — gr = 1. Note as well, by Exercise 3.3 on page 94, proper equivalence of forms is 
an equivalence relation. 


TT 
— 


this means that 


The notion of proper and improper equivalence is due to Gauss. Lagrange initiated the 
idea of equivalence, although he did not use the term. He merely said that one could be 
“transformed into another of the same kind,” but did not make the distinction between 
the two kinds. Similarly Legendre did not recognize proper equivalence. However, there 
is a very nice relationship between proper representation and proper equivalence, since as 
Exercise 3.2 shows, the form f(x,y) properly represents n € Z if and only if f(«,y) is 
properly equivalent to the form nx? + bry + cy? for some b,c € Z. 


Example 3.1 For f(z,y) = 27+ 7y?, n = 29=147-2? = f(1,2), f(x,y) is properly 
equivalent to g(x,y) = 29x? + 86xry + 64y? since f(x,y) = g(3x — y,—-2x + y), where 
p = 3,q = -l,r = —2,s = 1. With reference to Remark 3.1, X = 3x—y, Y = -—24+y 
represents a unimodular transformation. 


The following notion is central to the discussion and links equivalent forms in another way. 


Definition 3.2 — Discriminants of Forms 


The discriminant of the form f(x,y) = ax? + bry + cy? is given by 
D=0? — 4ac. 


If D > 0, then f is called an indefinite form. If D < 0 and a < 0, then f is called a negative 
definite form, and if D <0 and a> 0, then f is called a positive definite form. 
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Remark 3.2 By Exercise 3.7 on page 94, if forms f and g have discriminants D and Dj, 
respectively, and f(x,y) = g(px + qy,rx + sy), then D = (ps — qr)?D,. Thus, equivalent 
forms have the same discriminant. However, forms with the same discriminant are not 
necessarily equivalent — see Exercise 3.8. Furthermore, if f(x,y) = ax? + bry + cy”, then 
by completing the square, we get 


daf (x,y) = (2ax + by)? — Dy’, 


so when D > 0, the form f(x,y) represents both positive and negative integers. This is the 
justification for calling such forms “indefinite.” If D < 0 and a < 0, then f(x,y) represents 
only negative integers, thus the reason they are called “negative definite,” and if a > 0, 
then they represent only positive integers, whence the term “positive definite.” Since we 
may change a negative definite form into a positive definite one by changing the signs of 
all the coefficients, it is sufficient to consider only positive definite forms when D < 0. We 
will, therefore, not consider negative definite forms in any discussion hereafter. 


Biography 3.1 Adrien-Marie Legendre (1752-1833) was born on September 
18, 1752, in Paris, France. He was educated at the College Mazarin in Paris. 
During the half decade 1775-1780, he taught along with Laplace (1749-1827) 
at Ecole Militaire. He also took a position at the Académie des Sciences, be- 
coming first adjoint in 1783, then associé in 1785, and his work finally resulted 
in his election to the Royal Society of London in 1787. In 1793, the Académie 
was closed due to the Revolution, but Legendre was able to publish his phe- 
nomenally successful book Eléments de Géométrie in 1794, which remained 
the leading introductory text in the subject for over a century. In 1795, the 
Académie was reopened as the Institut National des Sciences et des Arts and 
met in the Louvre until 1806. In 1808, Legendre published his second edi- 
tion of Théorie des Nombres, which included Gauss’s proof of the Quadratic 
Reciprocity Law. Legendre also published his three-volume work Ezercises 
du Calcul Intégral during 1811-1819. Then his three-volume work Traité des 
Fonctions Elliptiques was published during the period 1825-1832. Therein he 
introduced the name “Eulerian Integrals” for beta and gamma functions. This 
work also provided the fundamental analytic tools for mathematical physics, 
and today some of these tools bear his name, such as Legendre Functions. In 
1824, Legendre had refused to vote for the government’s candidate for the In- 
stitute National, and for taking this position his pension was terminated. He 
died in poverty on January 10, 1833, in Paris. 


Congruence properties of the discriminant of a form may provide us with information on 
representation. For instance, Exercise 3.9 tells us that congruence properties modulo 4 
determine when an integer may be represented by forms with discriminant D = 0, 1(mod 4). 
Furthermore, this means that we can take the equation D = b? — 4ac and let a = 1 and 
b = 0 or 1 according to whether D = 0 or 1(mod 4), so then c = —D/4 or —(D — 1)/4, 
respectively. Thus, we get a distinguished form of discriminant D given as follows. 


Definition 3.3. — Principal Forms 
If D = 0,1(mod 4), then (1,0, —D/4) or (1,1, —(D-—1)/4), respectively, are called principal 
forms of discriminant D. 


90 3. Class Groups 


Remark 3.3 Via Exercise 3.10 on page 94, we see that if D = —4m, we get the form 
f(x,y) =2?+my?. As we shall see, these forms are particularly important in the historical 
development of the representation problem. Indeed, entire books, such as [15] are devoted 
to discussing this issue. There is a general notion that allows us to look at canonical forms 
for more illumination of the topic. This is given in the following, which is due to Lagrange. 


Definition 3.4 — Reduced Forms 


A primitive form f(x,y) = ax? + bry + cy’, of discriminant D, is said to be reduced if the 
following hold. 


(a) When D <0 anda>0, 


|b] <a<c, and if either |b] = a or a=c, then b> 0. (3.3) 


(b) When D > 0, 
0<b<VD and VD—b <2lal < VD +b. (3.4) 


Note that since f is positive definite in part (a) of Definition 3.4, then by Definition 3.2 on 
page 88, both a and © are positive. 


With the notion of reduction in hand, we have the following result, which provides us with 
a unique canonical representative for equivalence classes of positive definite forms. 


Theorem 3.1 — Positive Definite and Reduced Forms 


Every positive definite form is properly equivalent to a unique reduced form. 


Proof. Let f(x,y) = ax? +bry+cy? be a primitive positive definite form. Let n be the least 
positive integer represented by f. By Exercise 3.2, there exist B,C’ € Z such that f ~ g 
properly, where g(X,Y) = nX2+ BXY +CY*. For any integer z, the transformation 
X=x-zy,Y =y yields 


g(X,Y) = nx? + (B- 2nz)ry + (nz* — Bz + C)y’. 


If we set z = Ne(#), the nearest integer to B/(2n), then 


< 
5° 9g ee 


n< B-2nz <n, and |B—-2nz| <n. 
Thus, if we set b} = B—2nz and c; = nz? — Bz + C, then 
GX, Y) = na + bry + cy’, 


where |b;| <n. Thus, f is properly equivalent to g, g is positive definite, and g(0,1) = ci. 
Therefore, g represents c,, which implies c; € N, and c, > n by the minimality of n. We 
have shown that f is properly equivalent to a reduced form. The balance of the result will 
follow from the next result. 


Claim 3.1 Any two properly equivalent reduced forms must be identical. 


Suppose that the form f(x,y) = ax? + bry + cy” is reduced and properly equivalent to the 
reduced form g(x,y) = Av? + Bry + Cy? via the transformation 


g(x,y) = f(px + gy, rx + sy) 
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with ps — qr = 1. We may assume without loss of generality that a > A. Also, a straight- 
forward calculation shows that 


A= ap’ + bpr + cr’, 


B = 2apq + b(ps + qr) + 2crs, (3.5) 
C = aq? + bgs + cs”. 
Furthermore, we have 
lb] <a<e. (3.6) 
Using (3.6) we get, 
A= ap? + bpr + cr? > ap? — |bpr| + er? > ap? — |bpr| + ar? = a(p? +r?) — |bpr|. (3.7) 
However, since 
p’ +r? > Qlpr|, (3.8) 


then (3.7) is greater than or equal to 2alpr| — |bpr| > alpr|, where the latter inequality 
follows from (3.6) again. We have shown that 


A> alpr|. (3.9) 
However, by assumption a > A, so |pr| < 1. If |pr| = 0, then 
A = ap* + bpr + cr? > ap* +. ar? = a(p? + r7) >a, 


so A =a. On the other hand, if |pr| = 1, then by (3.9), A > a, so again we get A= a. 
It remains to show that B = b since, once shown, it follows from Exercise 3.7 on page 94 
that C = c, since B? — 4AC = b? — 4ac. 

Suppose that c > C. Then c > asince C > A =a. If |pr| = 1, then by (3.6)—(3.8), using the 
fact that cr? > ar?, we deduce A > a, a contradiction. Hence, |pr| = 0. If p = 0, then using 
(3.7)-(3.8), we conclude that A > a, so r = 0. Since ps — gr = 1, then ps = 1. Moreover, 
since |B| < A = a given that g is reduced, then from (3.6), we get —a < |B| — |b] < a. 
However, by (3.5), B = 2apq + 0. It follows that ¢g=0 and B=b. 

Lastly, suppose that c < C. By solving for a,b,c in terms of A,B,C we may reverse the 
roles of the variables and argue as above to the same conclusion that B = b. This completes 
the proof. 


Remark 3.4 The above says that there is a unique representative for each equivalence 
class of positive definite binary quadratic forms. Furthermore, by Exercise 3.11 on page 95, 
when D < 0, the number hp of classes of primitive positive definite forms of discriminant 
D is finite, and hp is equal to the number of reduced primitive forms of discriminant D. 
(Note that we prove hp < oo in general for field discriminants in Theorem 3.7 on page 106.) 


The case for indefinite forms is not so straightforward. The uniqueness issue, in particular, 
is complicated since we may have many reduced forms equivalent to one another, and the 
determination as to which reduced forms are equivalent is more difficult. Yet, we resolve 
this issue in Theorem 3.5 on page 101. 


We conclude this section with a result due to Landau. This result precisely delineates the 
negative discriminants D = —4n for which hp = 1 and the proof is essentially that of 
Landau [35]. 


92 3. Class Groups 


Biography 3.2 Edmund Landau (1877-1938) was born in Berlin, Germany 
on February 14, 1877. He studied mathematics at the University of Berlin, 
where his doctoral thesis, awarded in 1899, was supervised by Frobenius—see 
Biography 2.3 on page 80. Landau taught at the University of Berlin for the 
decade 1899-1909. In 1909, when he was appointed as ordinary professor at 
the University of Gottingen, he had amassed nearly seventy publications. His 
appointment at Gottingen was as a successor to Minkowski. Hilbert and Klein 
were also colleagues there—see Biography 3.4 on page 94. He became full 
professor there until the Nazis forced him out in 1933. On November 19, 1933, 
he was given permission to work at Groningen, Netherlands, where he remained 
until he retired on February 7, 1934. He returned to Berlin where he died of a 
heart attack on February 19, 1938. 


Landau’s major contributions were in analytic number theory and the dis- 
tribution of primes. For instance, his proof of the prime number theorem, 
published in 1903, was much more elementary than those given by Poussin 
and Hadamard—see [53, §1.9, pp. 65-72] for a detailed overview. He estab- 
lished more than 250 publications in number theory and wrote several books 
on number theory, which were influential. 


Theorem 3.2 — When h_,, =1 forn>0 
Ifn EN, then h_4, = 1 if and only if n € {1, 2,3, 4, 7}. 


Proof. Suppose that han =1. f(x,y) = x? + ny? is clearly reduced since a = 1, b = 0, and 
c= n> 1 in Definition 3.4 on page 90. The result is clear for n = 1, so we assume that 
n>. 


Case 3.1 n is not a prime power. 


There exists a prime p | n such that p%||n, for d € N, where || denotes proper division, 
also commonly called exactly divides, namely p4 | n, but p’+! + n — see [53, Definition 
1.3, p. 16] for the general notion. Let a = min(p%,n/p“) and c = max(p%,n/p“). Thus, 
gcd(a,c) = 1, where 1 < a <c, since n is not a prime power. Thus, g(x, y) = ax? + cy? isa 
reduced form of discriminant —4ac = —4n, so h_4n, > 1, given that f(z, y) is also a reduced 
form of discriminant D, unequal to g(x,y). This completes Case 3.1. 


Case 3.2 n = 2° where (EN. 


We need to show that h_4, > 1 for @ > 3. If € = 3, then D = —32 and the form 
g(x,y) = 327 + 2xy + 3y? is a reduced form of discriminant 2? — 4-3-3 = —32 not equal 
to f(x,y), so we may assume that @ > 4. Set 


g(@,y) = 4a? + day + (2°? + 1)y?, 


which is primitive since gcd(4, 4, 2°~? + 1) = 1, and reduced since 4 < 2’? + 1. Moreover, 
the discriminant is 


D=4?—4.4. (2°? 41) =-16- 2%? = — 2°? = —4n, 
but g 4 f. This completes Case 3.2. 


Case 3.3 n= p* where p > 2 is prime and k €N. 
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Suppose that n+ 1 is not a prime power. Then, as in Case 3.1, we may write n + 1 = ac, 
where 1 < a < cand gcd(a,c) = 1. Thus, 


g(x,y) = ax? + 2ey + cy? 


is a reduced form of discriminant 2? — 4ac = 4— 4(n+ 1) = —4n, and f #49, s0 h_4n > 1. 


Lastly, suppose that n +1 = 2° where t € N, observing that n +1 = p* +1 is even. If t > 6, 
then 
g(x,y) = 8x? + bry + (2°? + 1)y? 


is reduced since 8 < 2'~3 + 1, and gced(8, 6, 24-3 + 1) = 1. Also, g has discriminant 
D=6? —4.8(2'-3 41) =4-—4.2' =4—4(n+1) =—4n, 


and f 4g, so h_4a, > 1. For t < 5 we have that t € {1,2,3,4,5} have the corresponding 
values 
n € {1,3,7, 15,31}. 
It remains to exclude n = 15,31. 
If n = 15, then n is not a prime power so this violates the hypothesis of Case 3.3. If n = 31, 
then the form 
g(x,y) = 5x” + Ary + Ty? 


is reduced since b = 4 << a =5 < c=7, and is primitive since gcd(a, b,c) = 1. Lastly, the 
discriminant is 


D=4—4-5-7=—4-31. 
This completes Case 3.3, and we are done for this direction of the proof. 


For 
n € {1,2,3,4,7} 


we get that h_4,, = 1 from Exercise 3.13. 


Biography 3.3 Joseph-Louis Lagrange (1736-1813) was born on January 25, 
1736 in Turin, Sardinia-Piedmont (now Italy). Although Lagrange’s primary 
interests as a young student were in classical studies, his reading of an essay by 
Edmund Halley (1656-1743) on calculus converted him to mathematics. While 
still in his teens, Lagrange became a professor at the Royal Artillery School in 
Turin in 1755. Lagrange sent Euler some of his work, including methods in the 
calculus of variations, then called isoperimetrical problems. This helped Euler 
to solve a problem upon which he had been working for years. Ultimately, 
Lagrange succeeded Euler as director of mathematics at the Berlin Academy of 
Science in 1766. Most of his time at Berlin was spent on celestial mechanics and 
the polishing of his masterpiece Mécanique Analytique or Analytical Mechanics, 
which was published in Paris in 1788. In this work, he spoke of the science 
of mechanics as the geometry of four dimensions, three dimensional physical 
space and one time coordinate. This was exploited by Einstein in 1915, when 
he developed his general theory of relativity. Lagrange left Berlin in 1787 to 
become a member of the Paris Academy of Science where he remained for the 
rest of his professional life. When he was fifty-six, he married a young woman, 
almost forty years younger than he, the daughter of the astronomer Lemonnier. 
She became his devoted companion until his death in the early morning of April 
10, 1813 in Paris. 
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Exercises 


3.1. 


Done 


3.3. 


3.4. 
3.5. 
3.6. 


3.7. 


3.8. 
3.9. 


3.10. 


Prove that equivalent forms represent the same integers, and the same is true for 
proper representation. 


Prove that the form f(x,y) properly represents n if and only if f(x,y) is properly 
equivalent to the form nx? + Bry + Cy? for some B,C € Z. 


Prove that proper equivalence of forms is an equivalence relation, namely that the 
properties of reflexivity, symmetry, and transitivity are satisfied—see Exercise 1.8 on 
page 6. 


Biography 3.4 David Hilbert (1862-1943) was born in K6nigsberg, Prussia, 
which is now Kaliningrad, Russia. He studied at the University of Konigsberg 
where he received his doctorate under the supervision of Lindemann. He was 
employed at Konigsberg from 1886 to 1895. In 1895, he was appointed to fill 
the chair of mathematics at the University of Gottingen, where he remained for 
the rest of his life. Hilbert was very eminent in the mathematical world after 
1900 and it may be argued that his work was a major influence throughout 
the twentieth century. In 1900, at the Paris meeting of the Second Interna- 
tional Congress of Mathematicians, he delivered his now-famous lecture The 
Problems of Mathematics, which outlined twenty-three problems that continue 
to challenge mathematicians today. Among these were Goldbach’s conjecture 
and the Riemann hypothesis. Some of the Hilbert problems have been resolved 
and some have not, such as the two listed. Hilbert made contributions to many 
branches of mathematics including algebraic number theory, the calculus of 
variations, functional analysis, integral equations, invariant theory, and math- 
ematical physics. Hilbert retired in 1930 at which time the city of Konigsberg 
made him an honorary citizen. He died on February 14, 1943 in Gottingen. 


Prove that improper equivalence is not an equivalence relation. 


Prove that any form equivalent to a primitive form must itself be primitive. 


Prove that if f represents n € Z, then there exists a g € N such that n = g?n, and f 
properly represents 74. 


Suppose that f ~ g where f is a form of discriminant D and g is a form of discriminant 
D,, then D = (ps — qr)?D, = D, where f(x,y) = g(px + qy, ra + sy). 


Provide an example of forms with the same discriminant that are not equivalent. 


Let D = 0,1(mod 4) and let n be an integer relatively prime to D. Prove that if n 
is properly represented by a primitive form of discriminant D, then D is a quadratic 
residue modulo |n|, and if n is even, then D = 1(mod 8). Conversely, if n is odd and 
D is a quadratic residue modulo |n|, or n is even and D is a quadratic residue modulo 
4|n|, then n € Z is properly represented by a primitive form of discriminant D. 


Let n € Z and p > 2 be a prime not dividing n. Prove that p is represented by 
a primitive form of discriminant —4n if and only if the Legendre symbol equality 
(—n/p) = 1 holds. 


(Hint: Use Exercise 3.9.) 


3.1. 


3.11. 


3.12. 
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For a fixed integer D < 0, let hp be the number of classes of primitive positive definite 
forms of discriminant D. Prove that hp is finite and is equal to the number of reduced 
forms of discriminant D. 


Let n € N and p > 2 be prime with p{n. Prove that the Legendre symbol (—n/p) = 1 
if and only if p is represented by one of the h_4, reduced forms of discriminant —4n. 


(Hint: See Exercises 3.10-3.11 and Theorem 8.1 on page 90.) 


. Prove that ifn € {1,2,3,4,7}, then h_4, = 1. 


Biography 3.5 Carl Friederich Gauss (1777-1855) is considered to be among 
the greatest mathematicians who ever lived. His genius was evident at the 
age of three, when he corrected an error in his father’s bookkeeping. Also, 
at the age of eight, he astonished his teacher, Bittner, by rapidly adding the 
integers from 1 to 100 via the observation that the fifty pairs (j +1, 100-7) for 
j =0,1,...,49 each sum to 101 for a total of 5050. By the age of fifteen, Gauss 
entered Brunswick Collegium Carolinum funded by the Duke of Brunswick 
to whom Gauss dedicated his masterpiece Disquisitiones Arithmeticae [20], 
published in 1801. In 1795, Gauss entered Gottingen University, and by the age 
of twenty achieved his doctorate, which contained the Fundamental Theorem 
of Algebra—see Theorem A.18 on page 334. His intimate friend as a student 
was Farkas (or Wolfgang) Bolyai (1775-1856). Both had tried to prove Euclid’s 
parallel postulate, which is equivalent to the assumption that two converging 
lines must intersect. Although Bolyai gave up in frustration, Gauss had some 
ideas which, had he developed, would probably have led to his being credited 
with the discovery of non-Euclidean geometry, but the honour went to others. 
Gauss did publish his classic treatise Disquisitiones circa superficies curvas 
in 1827, which may be said to have initiated differential geometry. Gauss 
is credited with having invented two physical objects. One is the heliotrope, 
which worked by reflecting the sun’s rays using a small telescope and an array 
of mirrors. The other, in collaboration with Wilhelm Weber (1804-1891), was 
the invention of the first operational telegraph. 


He is also credited with computing, from some severely limited data, the orbit 
of Ceres Ferdinandea, discovered on January 1, 1801 by Piazzi, an Italian as- 
tronomer. Ceres was rediscovered by Zach, an astronomer and friend of Gauss, 
in June 1801, upon its reappearance from behind the sun, where Piazzi had 
lost his observation, leading to his small amount of data. Ceres was in virtu- 
ally the exact position where Gauss had predicted! Although Gauss did not 
disclose it at the time, he used his method of least squares approximation to 
do the calculation. Indeed, some contend that this calculation is what made 
Gauss famous—see the MAA award-winning article [67] by Teets and White- 
head. However, in total, Gauss’ accomplishments are too vast to discuss here 
in detail. 


Gauss was married twice. He married his first wife, Johanna Ostoff on October 
9, 1805. She died in 1809 after giving birth to their second son. His second 
wife was Johanna’s best friend Minna, whom he married in 1810. She bore 
him three children. Gauss remained a professor at G6ttingen until the early 
morning of February 23, 1855 when he died in his sleep. 
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3.2. Forms and Ideals 


Happiness is not an ideal of reason but of imagination. 


from section two of 

Fundamental Principles of the Metaphysics of Ethics (1785) 
Immanuel Kant (1724-1804) 

German philosopher 


We study how to “multiply” forms, which is called “composition of forms” and relate it 
to ideal multiplication, which allows us to prove the finiteness of class numbers, for the 
quadratic case, in a relatively easy fashion. Also, we intimately link the class group of 
forms with that of ideals. The quadratic case is made transparent via binary quadratic 
forms, whereas the general case requires Minkowski’s geometry of numbers in §3.3. Therein 
we prove the general case of finiteness of the ideal class number, motivated by the quadratic 
case—see Biography 3.6 on page 107. 

First we need to develop some new notions. The first result allows us to select a canon- 
ical form in each equivalence class. For ease of elucidation, we restrict our attention to 
discriminants that are field discriminants—see Definition 1.33 on page 46. 


Lemma 3.1 — Canonical Forms 


Let F = Q(VAF) be a quadratic field of discriminantA p and let m € Z. Then every 
proper equivalence class of forms of discriminantA - contains a primitive form with positive 
leading coefficient that is relatively prime to m. 


Proof. Let f = (a,b,c) € Ca, and set 
Pa anice = [[- 
p 
where the product ranges over all distinct primes p such that p | a, Pp | c and p | m. Also 
set 
Pam = II qd 
q 


where the product ranges over all distinct primes qg such that q | a, q | m, but q fc, set 


where the product ranges over all distinct primes r such that r | c,7r | m, but rf a, and set 
Sm = II 8 
s 


where the product ranges over all distinct primes s such that s | m but $f Pam,ePa,mPem:- 
Then f represents 
GPs + bPamPemSm + OPiS)" =N. (3.10) 


Claim 3.2 gcd(N,m) = 1. 


3.2. Forms and Ideals 97 


Assume that a prime t | N and t | m. Assume first that t | a. Then 
t | Pa,m,ePa,m 

by the definition of the latter. If t | Pam, then by (3.10), 
t | CReyiOnt 


However, t { PemSm, so t | c. This contradicts the fact that t | Pam. Hence, t { Pa,m, 80 
t| Pawm,ec- It follows from (3.10) that 


A aeredacrmcren 


However, we have already shown that t { Pa,m and since t | a, then t Pom. Also, t | Pais 
so t { S;,, which implies that t | b. We have shown that t | gcd(a, b,c), contradicting that 
f is primitive. Hence, our initial assumption was false, namely, we have shown that t { a. 
Therefore, 

| ere 


by the definition of the latter. However, by (3.10), this implies that t | aP,,m, a contradiction 
to what we have already shown. This secures the claim. 


By Exercise 3.2 on page 94, Claim 3.2 tells us that f is properly equivalent to the form 
g(x,y) = Nx? + Bry + Cy’ 
for some B,C € Z. If N > 0, then we have our result. 
If N <0, then by setting 79 = Bmé£+1 and yo = —2Ném for some ¢ € Z, 
g(x0; yo) = Nap + Broyo + Cyo 
= N(Bmé +1)? + B(Bmé + 1)(—2N€m) + C(2NEmy? 
= NB?m?0? +2NBml + N —2NB?m?0 — 2NBlm+ 4CN? Cm? 
= N(1— m?0?(B? —4NC)) = N(1—-m’?@Ar) =Q, 
where @ > 0 if N <0. 


Since f represents 
Q=N(1—-mCWAF) 


and Q is relatively prime to m, given that N and 1 — m?é?Ap are relatively prime to m, 
then Exercise 3.2 gives us the complete result. 


Now we make the connection with ideals. 

Theorem 3.3. — Ideals and Composition of Forms 

Suppose that 0 is the ring of integers of a quadratic field of discriminantA ¢ and 
f(z,y) = ax? + bry + cy? 


is a primitive form, with a > 0, of discriminantA - = b? — 4ac. Then 


I = (a,(-b+ V/Ar)/2) 


is an 9 ;-ideal. 
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Proof. Since Ap = b? — 4ac, then b? = Ap (mod 4a), so by Exercise 1.58 on page 54, I is 
an 0 p-ideal. 


Note that in Theorem 3.3, we must exclude the case a < 0 since the norm of an ideal 
must be positive. This excludes the negative definite case, but in view of Remark 3.2 on 
page 89, there is no loss of generality. Moreover, in the indefinite case, with a < 0, we 
may circumvent this via the techniques given in the proof of Theorem 3.5 on page 101. In 
particular, see (3.14) on page 103. 


Now we examine a means of associating forms in a unique way that allows us to “compose” 
them. 


Definition 3.5 — United Forms 


Two primitive forms f = (a1, b1,¢1) and g = (ag, b2, cz) of discriminant D are called united 
if gcd(ar, a2, (by + bz) /2) =ch. 


Note that in Definition 3.5, since b} — 4a,c,; = b3 — 4agc2, then b; and by have the same 
parity so (b) + b2)/2 € Z. 


Theorem 3.4 — United Forms and Uniqueness 


If f = (a1, 61, c,) and g = (ag, bz, cp) are united forms of discriminant D, where D is a field 
discriminant, then there exists a unique integer b3 modulo 2a,a2 such that 


b3 — b; (mod 20;), j = 1,2 


and 
b3 = D (mod 4ayaz). 


Proof. This is an immediate consequence of the multiplication formulas for quadratic ideals 
on page 48. 


Now we are in a position to show how to multiply or compose forms. 


Definition 3.6 — Dirichlet Composition?! 


Suppose that f = (a1, b1,¢,) and g = (dg, be, cz) are primitive, united forms of discriminant 
Ar whereA = is a field discriminant, a3 = a,qa2, 63 is the value given in Theorem 3.4, and 
b3 — Ap 

4a3 . 


i 


Then the Dirichlet composition of f and g is the form 


fog =G= (a3, b3, €3). 


3-1 As a point of interest, there is a recent paper—see [4]—that shows how composition of binary quadratic 
forms leads to parametrizations of cubic, quartic, and quintic number fields. These, in turn, are shown to 
lead to formulas for counting the number of quartic and quintic number fields of bounded discriminant, as 
well as yet-to-be-determined connections with exceptional Lie groups and higher rank division algebras, for 
instance. 
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Remark 3.5 Note that 
(az, (bs + V/Ar)/2) 

is an-O -ideal where F = Q(AF) by the multiplication formulas given on page 48. This 
shows the intimate connection between multiplication of quadratic ideals and composition 
of forms. Indeed, we need not restrict to field discriminants for this to work. We could 
expand the discussion to non-mazimal orders in quadratic fields but then the delineation 
becomes more complicated, since we must rely on special conditions for invertibility of ideals 
and other considerations, all of which are satisfied in the so-called maximal order Dp. See 
[49] for the more general approach. 


The form G, in Definition 3.6, is a form of discriminant 
b3 me 4a3cg3 = b3 = 4a3(b3 = Ar)/(4az3) = b3 = b3 + Ar _ Ar. 


Also it is primitive since if a prime p | gcd(az, b3,c3), then p | a, or p | a2. Without loss of 
generality suppose it divides a,. Then since p | bs, we must have that p | by since b3 = by 
(mod 2a;) by Theorem 3.4. However, since p | c3 and b} — 4a3c3 = D, then p? | Ap. 
However,A - is a field discriminant so p = 2 andA p = 0(mod 4) is the only possibility. By 
Definition 1.33 on page 46,A -/4 = 2,3(mod 4). IfA -/4 = 2(mod 4), then by Theorem 


3.4, b3/2 is even since 
2 
(2) = (mod ayaz), 


given that 2 | a1. However, we have 
bs \* A 
(2) — agc3 = aa (3.11) 
so since 2 | a3 and 2 | c3, then Ar /4 = 0(mod 4), a contradiction. Thus, 
Ar/4=3 (mod 4), 


so by (3.11), b3/2 is odd. However, (3.11) impliesA -/4 = 1(mod 4), a contradiction. We 
have shown that, indeed, G is a primitive form of discriminantA -. 


Remark 3.6 The opposite of 
f = (4,8, ¢) 


‘i = (a, —b, e)s 
which is the inverse of f under Dirichlet composition. To see this we note that under the 
proper equivalence that sends (x,y) to (—y,x), f~' ~ (¢,b,a), for which ged(a,c,b) = 1. 
This allows us to choose a united form in the class of f~! by Definition 3.5, so we may 
perform Dirichlet composition to get 
b? — Ar 

4dac 


fof *=G=S («« b, ) = (ac, b, 1). 


Moreover, by Exercise 3.19 on page 107, 
G~ (1,0, 47/4) whenA - =0 (mod 4) 


and 
G ~ (1,1,(1-— Ar)/4) whenA - =1 (mod 4). 


Thus, G is in the principal class by Corollary 3.1 on page 103. 
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We now need to introduce the ideal class group as a vehicle for defining the form class group 
since Theorem 3.3 on page 97 gives us the connection. 


Definition 3.7 — Equivalence of Ideals 


Let Or be the ring of integers of a number field F’. Then two OD p-ideals I, J are said to 
be in the same equivalence class if there exist nonzero a,0 € Or such that (a)I = (8)J 
denoted by I~ J. 


Remark 3.7 By Theorem 1.26 on page 42 and Exercise 1.42 on page 33, we know that 
the set of all fractional Op-ideals forms a multiplicative abelian group. If we denote this 
group by J,,, and let Pa, denote the group of principal fractional ideals, then the quotient 
group 


Ta, 
=C 
Pea, 8 
is called the class group of Or, and 
hop ax Corl, 


is the ordinary or wide class number, which we will show to be finite. (First, we show 
finiteness in the (easier) quadratic case below—see Corollary 3.4 on page 106—then develop 
the geometry of numbers for the general case—see Theorem 3.11 on page 116.) Also, the 
class of an Oy-ideal I is denoted by I. Thus a product of classes IJ = C is the class 
belonging to any ideal C = IJ formed by multiplying representatives J € I and J © J. 
The identity element 1 is the principal class, namely all principal ideals (a) ~ (1), meaning 
(a) € 1. The existence of inverse classes I~* for any class I is guaranteed by Exercise 1.43 
and Theorem 1.26, namely II~? = 1. The commutative and multiplicative laws are clear, 
namely 
IJ = JI, and I(JK) = (IJ)K, for O,-ideals I, J, K. 


Also the (integral) prime ideals are the generators of the class group. To see this let J be a 
fractional 9 p-ideal and let a € Or be a nonzero element such that al C Op. Then aJ is 
an integral 9 --ideal and 


(aDpr)'(aJ) =I = II Pa, 
j=l 


where the a; € Z are not necessarily positive and the P; are distinct prime 0 -ideals as 
determined by Theorem 1.17 on page 28. 


Note as well, that the conjugate ideal I’ for I, first mentioned in Remark 1.24 on page 52, 
satisfies 
TY} ane YT 


—see Exercise 3.20 on page 107. In what follows, we will need to refine this concept a bit in 
order to be able to include indefinite binary quadratic forms. We let Pee denote the group 
of principal ideals (a) where Nr(a) > 0—see Definition 2.4 on page 65. Then we let 


Th. z 


+ — “DOr 
Pag 


known as the narrow ideal class group, or sometimes called the strict ideal class group. 
Also, 
+ iat 
hs. ‘= ICS. ? 
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is the narrow ideal class number. Clearly, when F is a complex quadratic field, then Co, = 
CS since norms are necessarily positive in this case. In the real case we will learn more 
as we progress. 


Note that in what follows, we use the symbol ~ to denote both equivalence in the ordinary 
ideal class group Cp, as well as equivalence of forms, but this will not lead to confusion 
when taken in context. 


We use the symbol & to denote strict equivalence in Cs 2 In other words, I ~ J in CL. 
when there exist a,3 € Op such that 


where Nr(aZ) > 0. The next result shows that this is tantamount to form equivalence. 


Theorem 3.5 — Form and Ideal Class Groups 


If Ca, denotes the set of classes of primitive forms of discriminantA -, where F is a 
quadratic field, then Ca, is a group with multiplication given by Dirichlet composition and 


+o ow 
Ch, = Cap. 


Proof. Let f = (a1,61,¢,) and g = (dg, be, c2), then by Exercises 3.2 and 3.9 on page 94, 
g ~ (a5,b5,c5) where gcd(a1, a5) = 1. Thus, Dirichlet composition is defined so we may 
assume the f and g to be united, without loss of generality. Let F’ = (a3, b3,c3) be given 
as in Definition 3.6 on page 98. Then we know that via the ideal correspondence given in 
Theorem 3.3 on page 97, 


(a1, (b1 — V/Ar)/2)(a2, (bs — VAr)/2) = (as, (63 — V'Ar)/2), (3.12) 


via the multiplication formulas on page 48. Thus, by Theorem 3.3 and (3.12), the Dirichlet 
composition of f(x,y) and g(x,y) corresponds to the product of the corresponding ideal 
classes, which shows that Dirichlet composition induces a well defined binary operation on 
Cap: 
Note that in what follows, if we have strict equivalence of ideals given by 

I= (a,(-b+ VAp)/2) © J = (a’, (-0 + VAr)/2), (3.13) 


then we may replace I by (aa’)I and J by (a?).J, so we may assume without loss of generality 
that a =a’. Via Theorem 3.3, we may define a mapping from CS. to Ca, as follows 


T:(a,(—b+ VAr)/2) ++ f = (a,b,c), 
where c = (b? — Ar)/(4a). Moreover, by the above, 
T(IJ) = r(I)r(J) 


since we have shown that ideal multiplication corresponds to form multiplication. To see 
that 7 is well defined, assume that a’ > 0 and b’ € Z in (3.13). Thus, since there are 
d,y € Or such that (6) = (y)J where Np(dy) > 0 then 


Nr(6/y) NW) = NU) =a, 
so Nr(d/y) = 1. By Exercise 3.21 on page 107, there is ao € Or such that 6/y = c/o’. If 


Me,Q(£) = uz? + vr +w 


102 3. Class Groups 


is the minimal polynomial of o over Q, then it is for o’ as well, so T(7) = T(o’) = (u,v, w). 


Hence, 
7((6/y)1) = T((e/0")) (2) = 7 (2). 


Hence, it suffices to prove that 7([) = 7(J) when I ~ J. By Exercise 1.59 on page 54, there 
exists 


X= ( poe ) € GL(2,Z), 
r Ss 


such that 
(-b+VAF)/2\ _ ee (—b' + /Ar)/2 
a ~ a ; 
Therefore, 
(= **) —b+ JAF 
. 2 a 2 
and 


(i) 
Tr Saas +sa=a, 


from which it follows that r = 0, s =p =1, and b= b’ — 2qa. Hence, 


ax? + bey + cy* = f(x,y) = g(a — ay, y) = a(x — qy)? +0 (x — qyy+ cy’, 


so f and g are properly equivalent, namely they are in the same class in C',,, so T is well 
defined. Now we establish the isomorphism. 


First we show that 7 is injective. Let 
r(a,(—b + VAp)/2) = f = (a,b,c) ~ r(a’, (-8 + VAp)/2) =9 = (@,8',c) 


in Ca,. Since 
(aa’)(a, (—b + Ar) /2) = (a”)(a’, (0 + VAr)/2) 


as O p-ideals, then we may assume that a = a’ without loss of generality since, if they are 
not equal, we may change the preimage to make it so as above. Now since 


(a) =o= 7 (4"* 1), 
2a 2a 


then 


—b+ J/Ar _ b+ 7Ar on =p+/Ap bake 

2a = 2a 2a 7 2a : 
given that these are the only two roots of f(a, 1) = ax? + bx + c = 0. However, the latter 
is impossible by comparing coefficients so the former holds, from which we get that b = b’ 
soc=c’. Thus, T is injective. 


either 


Lastly, we show that 7 is surjective. Let 
f(x,y) = ax? + bry + cy’ 
be a primitive form of discriminantA , and let 
a = (—b + VApr)/(2a). 
Then f(a, 1) = 0, and aa € Op. Define an DO -ideal as follows. Set 


3.2. Forms and Ideals 103 


_) (a,aq) ifa>0, 
~ (VAr)(a,aa) ifa<OandA p>0. (3.14) 


Therefore, 7(I) = (a,b,c) in the first instance is clear. In the second instance, we note that 


I = (a, (—b + /Ar)/2) so 
r(I) = 7((a, (—b + VAr)/2)) = (a,b, 0). 


Hence, 7 is surjective and the isomorphism is established. 


Corollary 3.1 The identity element of Ca, is the class containing the principal form 
(1,0,-Apr/4) or (1,1, (1 — Ar)/4) forA p = 0,1 (mod 4), respectively. 


Proof. Since 
gel VAr/2) = (1,0, —Ar/4) or rl, (i 5 VAr)/2) = cL 1, (1 = Ar)/4) 


depending on congruence modulo 4 ofA -, and the preimages are the identity elements in 
the principal class of CE. then the images are clearly the identity elements in the principal 
class of Ca. 


Remark 3.8 When F is a complex quadratic field, as noted in Remark 3.7 on page 100, 
Co, = Ch. 

so by Theorem 3.5 on page 101, 
Ca, = Co,. 


However, in the real case, this is not always true. For instance, by Exercise 3.14 on page 106, 
in the case whereA p = 12, Ca, 4 {1} and Co, has order 1. Yet by Theorem 3.5, 


+ ~~ 
Ch, Cap. 


Indeed, the case where the field F’ is real and has a unit of norm —1 or F is complex, then 
by Exercise 3.17 on page 107, Co, = CS. always holds. When F is real and has no such 
unit, for instance as in theA - = 12 case, then by Exercise 3.16, 


ICE. :Co,| = 2. 


Note as well, by Theorem 3.5, 

Bes =ha,, 
the number of classes of forms of discriminantA -. Also by the above discussion, we have 
demonstrated the following. 


Theorem 3.6 — Class Numbers of Forms and Ideals 

IfA pf is the discriminant of a quadratic field F’, then the class number of the form class 
group ha,, as well as that of both the wide ideal class group ho, and the narrow ideal 
class Dies is related by the following. 
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hoy ifA p <0, 
hoy ifA p > 0 and there exists a u € Up 
hap = hg. = with Np(u) = —1, 
2hp, ifA p >0 and there is no u € Up 
with Ne(u) =—-1. 


We conclude this section with a verification that ha, is finite. To do this we first need the 
following result. 


Lemma 3.2 — A Form of Reduction 
IfA p is the discriminant of a quadratic field F’, then in each class of Ca, there is a form 
f = (a,6,c) such that 
|b] < Jal < |e]. 
Proof. Let the form f = (a1,61,c1) be in an arbitrary class of Ca,. We may select an 


integer a such that |a| is the least value from the set of nonzero integers represented by 
forms in the class of f. Then there exist p,r € Z such that 


a= ap? + bipr+ cr’. (3.15) 


If g = gcd(p,r), then a/g? is represented by f, contradicting the minimality of |a| unless 
g =1. Therefore, by the Euclidean algorithm, there exist integers q, s such that ps—qr = 1. 
Also, 


f(px + gy, ra + sy) = a1(pa + qy)? + bi (pa + qy)(ra + sy) + c1(ra + sy)? = 
(ap? + bipr + cyr?)a? + (2pqay + (ps + gr)by + 2rsc,)xy + (arq? + bigs + c187)y? = 
ax? + Bry + Cy’, 
where the coefficient for 2? comes from (3.15), 
B = (2pqa, + (ps + qr)bi + 2rscy), 


and 
C =aiq" + bigs + cs". 


Set g(x,y) = ax? + Bry + Cy? and we have f ~ g in Ca,. We may select an integer m 
such that 


|\2am + B| < |al. (3.16) 

Thus, 
g(x + my, y) = a(a + my)? + B(x + my)y + Cy? = 
ax” + (2am + B)ary + (am? + Bm + C)y? = 

ax? + bry + cy”, 

with 
b=2am+ B, 
and 
c=am? + Bm+C. 

Set 


h(a, y) = ax? + bay + cy”. 
Then, sinceA - = b? — 4ac, given that f ~ g ~ h, then c = 0 implies thatA p = b?, a 
contradiction to the fact thatA - is a field discriminant. Hence, since h(0,1) = c, then 
\c| > |a| by the minimality of |a|. Thus, from (3.16), we have the result. 
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Corollary 3.2 Any form of discriminantA ; is equivalent to a reduced form of the same 
discriminant. 


Proof. By Theorem 3.1 on page 90, we need only prove the result forA p > 0. 
Claim 3.3 We may assume that (a, b,c) satisfies |a| < |c] with 
/Ap— lal <b<4/Ap. 
By Lemma 3.2, we may select a form (a,b,c) such that |b| < |a| < |e]. If “Ar — 2\a| > 6, 


then by setting 
VA 
m= ae Bel at | , 


2c 2c 
where 
_jJi1 ife<0, 
>) 0 Gfe> 0 
we get 


V Ar — 2I\c| < —b+ 2em < VApr. 


We now show that 
(a,b,c) ~ (c, —b + 2cem,a — bm + cm’). (3.17) 


Via the map 7 in Theorem 3.5, 


( —b+ /Ar 
Te a, —2.—_ 


5 ) + (a,b,c), 


and 


( b—2em+ /Apr 
T:(¢, ; 


) + (c,—b + 2cm,a — bm + cm’), 
as O p-ideals. However, by Exercise 1.60 on page 54 


(« ae 7 (c tyes) 
eo je ee ea 


sO 


= i — 9 
Cecae a Gan 


Since 7 is a bijection, we have established (3.17). 
If |a — bm + cm?| < |c|, then we repeat the (finite) process, this time on 


(c, —b + 2em, a — bm + em?), 


which must terminate in 


(A, B,C) ~ (a,b,c) 


|A| <|C| and /Ap — 2|A| < B< Ap. 


with 


This is Claim 3.3. 
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Therefore, 


0< VAr—b< 2la| < cl = [Ar Fl | VB +3). 


~ Fla) 


Hence, b > 0, so b? < Ar and |2a|? < 4|ac| = Ap — b? < Ap, 80 2|a| < /Ar < VAr +6, 
from which it follows that (a, b,c) is reduced. 


Theorem 3.7 — ha, <c 
If F is a quadratic field with discriminantA 7, then ha,, is finite. 


Proof. Note that by Exercise 3.11 on page 95, we need only consider the case whereA pf > 0. 
By Lemma 3.2 on page 104, for any class of Ca,, there is a form f = (a,b,c) in the class 
with 

lac] > b? = Ap + 4ac > 4ac, 


so ac <0. Moreover, 4a? < 4jac| = —4ac = Ap — b? < Ap. Therefore, 


lal < VAr/2, (3.18) 


so by Lemma 3.2, 
|b] < ~Apr/2. (3.19) 
Hence, by the bounds in (3.18)—(3.19), there can only be finitely many choices for the values 


a and b for a given discriminantA -. Since c = (b? — A)/(4a), we have established the 
result. 


Corollary 3.3 — Positive Definite Forms and Reduction 


WhenA fF < 0, then the number of inequivalent positive definite forms with discriminant 
AF is the same as the number of reduced forms. 


Proof. See Exercise 3.11. 


Corollary 3.4 — hp, < oo 
IfA - is the discriminant of a quadratic field F’, then ho, is finite. 


Proof. This follows from Theorem 3.6 on page 103 and Theorem 3.7. 


Exercises 


3.14. Prove that whenA - = 12 where F = Q(¥V3), then the form f = (—1,0,3) is not 
properly equivalent to the form g = (1,0,—3). This shows that Ca, 4 {1}. Show, 
however, that Co, = {1}. 


(Hint: See Corollary 1.1 on page 13 and Theorem 1.18 on page 29.) 
In Exercises 3.15-8.17, assume that Ap is the discriminant of a quadratic field F’. 


3.15. Let F be a real quadratic field and set 


= (1, 0, -Ar/4) ifA Fr = 0(mod 4), 
= (1,1,(1-Apr)/4) if p = 1(mod 4). 


Prove that a ~ —a in Cag, if and only if Op has a unit u such that Nr(u) = —1. 


3.2. 


3.16. 


3.19. 


3.20. 


3.21. 
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Let F’ be a real quadratic field. Assume that DO, does not have a unit of norm —1. 
Prove that ICS. :Co,| = 2. 
(Hint: Use Exercise 3.15.) 


. Prove that CS. = Co, if F is either a complex quadratic field or F' is a real quadratic 


field such that Or has a unit u with Ne(u) = —-1. 
(Hint: Use Exercise 3.15.) 


. Let F be a number field and let hg, be the (wide) class number of F’. Prove that if 


I is an integral D p-ideal, then I’9r ~ 1. 

(Hint: By Theorem 3.7, |ho,| <0co.) 

Prove the assertion made in Remark 3.6 on page 99 that (ac, b,1) ~ (1,0, Ar /4) when 
Ar = 0(mod 4) and (ac, b, 1) ~ (1,1,(1—- Ar)/4) whenA = 1(mod 4). 

(Hint: When Apr = 0(mod 4), in Definition 3.1 on page 88, select p = b/2, q = 1, 
r=-—1, ands =0, and when Ap = 1(mod 4) select p= —(14+ b)/2,q=-l,r=1 
and s = 0.) 

Prove that I’ = 17! in Co,. 

(Hint: Use The Multiplication formulas on page 48.) 

Let u be a unit in Or such that Ne(u) = 1. Prove that there exists an a € Or such 
that a = ua’, where a’ is the algebraic conjugate of a. 


(This exercise represents the quadratic analogue of Hilbert’s Theorem 90—see Biog- 
raphy 3.4 on page 94.) 


In 83.3, we will be looking at the work of Minkowski in the geometry of numbers, which 
opens the door to establishing Dirichlet’s celebrated unit theorem. 


Biography 3.6 Hermann Minkowski (1864-1909) was born on June 22, 1864 
in Alexotas of what was then the Russian empire, but is now Kaunas, Lithuania. 
He studied at the Universities of Berlin, then Konigsberg where he received his 
doctorate in 1885. Yet, even before this, in 1883, both he and Henry Smith 
were jointly awarded the Grand Prize by the Academy of Sciences (Paris) for 
the solution of the problem of representations of an integer as a sum of five 
squares. Eisenstein knew of a formula for such representations in 1847, but 
never provided a proof. 


Minkowski taught at Bonn, Konigsberg, and Zitirich, but in 1902, Hilbert cre- 
ated a chair for him at Gottingen where Minkowski stayed for the rest of his 
life. He pioneered the area we now call the geometry of numbers. This led to 
work on convex bodies and to packing problems. Furthermore, his geometric 
insights paved the way for modern functional analysis. At age 44, he died 
from a ruptured appendix on January 12, 1909 in Gottingen. Posthumously, in 
1910, his most original work, begun in 1890, was first published as Geometrie 
der Zahlen. 


Minkowski’s main interests were in pure mathematics, especially continued 
fractions and quadratic forms. However, he is also known for having laid some 
groundwork for Einstein’s relativity theory by thinking of space and time as 
linked together in a four-dimensional space-time continuum, from which he 
determined how to treat electrodynamics from a four-dimensional perspective. 
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3.3. Geometry of Numbers and the Ideal Class Group 


The human heart likes a little disorder in its geometry. 
from chapter 26 of Captain Corelli’s Mandolin (1994) 
Louis de Berniéres (1954—) 
British novelist and short-story writer 


In this section, we introduce Minkowski’s geometry of numbers in order to prove Dirichlet’s 
celebrated unit theorem, which we use to establish the finiteness of the ideal class number. 
In 83.2 we used the notion of forms to deduce this finiteness in the quadratic case—see 
Biographies 3.6 on the previous page and 3.9 on page 121. The reader must be familiar 
with vector spaces and related notions in Appendix A. 


Definition 3.8 — Lattices and Parallelotopes 
Let 1, £2,...,4m € R”, m,n € N,m <n be R-linearly independent vectors. If 


L={€ER":L=)_ 2,6; for some 2; € Z} = Z[Ky,..., Lm]; 


j=1 


then LD is called a lattice of dimension m in R”. When m =n, L is called a full lattice. In 
other words, a full lattice LD is a free abelian group of rank n having a Z-basis that is also 
an R-basis for R”. Furthermore, the set 


P= Sorc eR 0 Sy <a forg = ce en 


j=l 


is called the fundamental parallelotope, or fundamental parallelepiped, or fundamental do- 
main of L. An invariant—see Remark 3.9 below—of P is 


V(P) = | det(¢5)I, 


called the volume of P, and also called the discriminant of L, denoted by D(L). 


Remark 3.9 In Definition 3.8, the term invariant, when applied to P means that, irre- 
spective of which basis we choose for L, the volume of P remains the same. It is an easy 
exercise for the reader to verify that the determinant remains the same under change of 
basis using Exercise 2.34 on page 81. For the reader with a knowledge of measure theory, 
or Lebesgue measure in R”, the volume of a so-called measurable set S C R” is called the 
measure of S. This measure can be shown to be the absolute value of the determinant of the 
matrix with rows @; for 7 = 1,2,...,n for any basis {¢;} of S. Thus, the Lebesgue measure 
of S is called the volume of S. 


Example 3.2 Z” is a full lattice in R” for any n € N. In other words, a free abelian group 
of rank n in R” is a full lattice. Hence, 9p is a full lattice in R", where |F' : Q| =n. Also, 
note that any lattice of dimension m € N is full in R™. 
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We will now show that lattices as subsets of R” are characterized by the following prop- 


erty. First, we remind the reader that if s = (s1,82,...,5n) € R”, then |s| <r means that 
1/2 
a1 87 S77, since |s| = Os 3) , so |8;| <r for each such j. 


Definition 3.9 — Discrete Sets 
Suppose that S C R", n €N, r € R™, the positive reals, and 


8, = {s € R”: |s| <r} 


is the sphere or ball in R”, with radius r, centered at the origin. Then S is called discrete 
if 

ISM §,| < oo, 
for allr € Rt. 


Theorem 3.8 — Lattices Are Discrete 


Let D CR", LAS. Then L is a lattice if and only if L is a discrete, additive subgroup of 
R”. 


Proof. Let L be a lattice of dimension n, namely a full lattice in R”. If 
L=Z0@---@lyZ, 


{@,...,€n} is an R-basis for R”. Thus, any a € R” can be written in the form 
a= SG (rj E R). 
j=l 


Ifa € LN8, for any r € R*, then each r; € Z and |r;| <r for each j = 1,2,...,n. Hence, 
there exist only finitely many points in D1 §,.. In other words, L is discrete. 


Conversely, assume that L is a discrete, additive subgroup of R”. We use induction on n. 
For n = 1, let {€} be a basis for R, namely R' = RZ. Since $, 1 L is finite for all r € Rt, 
there exists a smallest positive value r; such that r,@€ L. Therefore, Zr ,@ C L. Since any 


s € R may be written as s = =| Tr, + 5171, for some real number s, with 0 < s; < 1, 
s 


then any sf € L may be written in the form sé = nr ,é+4+ s1r,@, with n = Fal € Z, and 


0 < s; <1. Therefore, by the minimality of r;, we must have that s; = 0, so L = Z[r,@]. 
This establishes the induction step. Assume the induction hypothesis, namely that any 
discrete subgroup of R* for k <n is a lattice, so we may assume that L C R” is discrete 
and L ¢ R* for any k < n. Hence, we may choose a basis {¢1,..., én} of R" with ¢; € L 
for each 7 = 1,2,...,n. Set 

V=R[4&,..-,€n—1]- 


By the induction hypothesis, 
Ly =LNV 


is a lattice of dimension n—1. Let {1,..., 8n—1} be a basis for Ly. Therefore, any element 
y € L may be written as 


n-1 


y= yo 738; +tnén (7; ER). 


j=1 
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By the discreteness of L, there exist only finitely many such y with all r; bounded. Thus, 
we may choose one with r, > 0, and minimal with respect to |r;| <1 for all 7 An. Let 6, 
denote this choice. Thus, 

R” = R[G1,---, Bn]. 


Then for any 6 € L, 


j=l 
Let mn a 
oa =6—) |t)|8; = >_ 8585. 
j=l j=l 
Therefore, 0 < s; < 1 for all j =1,...,n. By the minimality of r,, we must have s,, = 0. 


Hence, o € Ly, so 6 € Ly @ ZG,. This gives us in total that 
LCLly ®ZB, CL. 


Therefore, L = Ly © ZB, is a lattice. 
We also need other fundamental notions from geometry. 


Remark 3.10 In what follows, we use the fact that the volume of every bounded convex 
set exists, called Blanschke’s theorem. 


Definition 3.10 — Bounded, Convex, and Symmetric Sets 
A set S in R” is said to be convex if, whenever s,t € S, the point 
As+(1-A)te S$ 


for all A € R such that 0 < A < 1. In other words, S is convex if it satisfies the property 
that, for all s,t € S, the line segment joining s and ¢ is also in S. The volume of a convex 
set S is given by the multiple integral 


S 
carried out over the set S. 


A set S in R” is said to be bounded if there exists a sufficiently large r € R such that |s| <r 
for all s € S. Another way of looking at this geometrically is that S is bounded if it can fit 
into a sphere with center at the origin of R” and radius r. 


A set Sin R” is symmetric, sometimes called centrally symmetric, provided that, for each 
s€S,we have -seéS. 


Remark 3.11 By Remark 3.10, the integral in Definition 3.10 always exists for convex 
sets. 


Example 3.3 Clearly, ellipses and squares are convex in R?, but a crescent shape, for 
instance, is not. Also, an n-dimensional cube 

S ={s = (s1,...,8n) € R”:-1<s; <1 for 7 =1,2,...,n} 
is a bounded, symmetric convex set, as is an n-dimensional unit sphere 


{s © R”:|s| < 1}. 
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Before proceeding to the main result, we need a technical lemma. 


Lemma 3.3 — Translates and Volume 


Let S C R” be a bounded set and let L be an n-dimensional lattice. If the translates of S 
by L, given by 
S,={s+z:s8€S}, 


for a given z € L, are pairwise disjoint, namely 
S,10 Sy = ©, 
for each y, z € L with y ¥ z, then 
V(S) < V(P) 
where P is a fundamental parallelotope of L. 
Proof. Since P is a fundamental parallelotope of L, we have the following description of S 


as a disjoint union: 
S=Uzer(SNP_.), 


where P_, = {x —z: x € P}, so it follows that 


VS)=>S VisnPs). 


zeL 


Since the translate of the set SMP_, by the vector z is S$, P, then 
V(SOP_,) =V(S,NP). (3.20) 


Therefore, 
V(s) = >_V(S, NP). 
zeL 
If the translates S, are pairwise disjoint, then so are S$, P. Since S,1P C P, then 
Equation (3.20) tells us that 
S7V(SzNP) < V(P), 
zeL 


so the result is proved. 


Now we are in a position to state the central result of this section. 


Theorem 3.9 — Minkowski’s Convex Body Theorem 


Suppose that L is a lattice of dimension n, and let V(P) be the volume of a fundamental 
parallelotope P of L. If S is a symmetric, convex set in R” with volume V(S) such that 


V(S) > 2"V(P), 
there exists an x € SM L such that x ¥ 0. 


Proof. It suffices to prove the result for a bounded set S. To see this, we observe that when 

S is unbounded, we may restrict attention to the intersection of S with an n-dimensional 

sphere, centered at the origin, having a sufficiently large radius. Let 
Pese=te/d6-e SO). 


mo) 
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Then 


If the translates T, = 35 + z were pairwise disjoint, then by Lemma 3.3 on the preceding 
page, V(P) > V(Z), a contradiction. Therefore, there must exist two distinct elements 
s,t € L such that 


($5 —s)N(ZS—t) #@. 


Let z,y € S' such that $x —s= sy —t. Thent-—s= sy - $a. Since S' is symmetric, then 
—a € S, and since S' is convex, then sy + 3(—2) € S. Hence, t—sE SOL, andt—s40, 
as required. 


Remark 3.12 Some background to the language used above is in order. The term convex 
body refers to a nonempty, convex bounded and closed subset S of R”. The topological 
term “closed” means that every accumulation point of a sequence of elements in S must 
also be in S. This is tantamount to saying that S is closed in the topological space R”, 
with its natural topology. However, we do not need to concern ourselves here with this, 
since it is possible to state and prove the result without such topological considerations. It 
can also be shown that if S is “compact,” namely every “cover” (a union of sets containing 
S') contains a finite cover, then it suffices to assume that V(S) > 2”V(P). 


In order to prove the next result, we need a geometric interpretation of algebraic numbers 
in a canonical way. This is based upon the signature of a field given in Exercise 2.11 on 
page 63. 


Definition 3.11 — Canonical Embedding of Number Fields 

Let {r1,7r2} be the signature of a number field F. Suppose that 6;(£) C R for j = 1,...,171 
are the real embeddings of F' in C, and 6;(F) Z R for 7 =7, +1,...,71 +72 are half of the 
complex embeddings of F' in C, chosen such that exactly one 6; is taken from each complex 


conjugate pair 0;,0; of such embeddings. Then for each a € F’, define 
Or: Ft>R™xC”? 


by 
Or(a) = (01 (a), eyeena Or, (a), Or, 41(@), ooh Ory tre (a)). 


Remark 3.13 With reference to Definition 3.11,0 - is a Q-algebra monomorphism by 
Exercise 3.29 on page 121. Moreover, we may say more aboutO Ff as follows. 

R™ x C™ may be identified with R", where n = 1; + 2rg =|F': Q|, since each complex 
component 6;(a@) for 7 =7r1 +1,...,71 +12 may be replaced by a pair of components 
R(O;(a)), $(0;(a)) where R(x) and S(a) are the real and complex coefficients of 


z= R(t) + S3(x)V—-1 EC. 


Hence,O pf may also be considered as an injection into the real vector space R”. We will 
have significantly more to say about this later. 


We now provide an application of Minkowski’s Convex Body Theorem to the relationship 
between discriminants and norms of algebraic integers, which will prove to be highly valuable 
later in the text as well. 
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Theorem 3.10 — Applications to Norms and Discriminants 


Let {71,172} be the signature of a number field F’, with |F' : Q| =n = 11 + 2re, and let M 
be a free abelian group (Z-module) of finite index in Oy, namely |Dr : M| =meN. Then 
there exists a nonzero a € M such that 


Proof. Let B € R™, and define a set: 


TL 


2 
Sp(ri, re) = (Q1,--+,Q@r,, B1,-++5 Bro) € R" ~oCe :S- Jagl+25— 185 < B 


j=l j=l 


Clearly Sg is bounded and symmetric. We now verify that it is convex. Let a,b € R with 
a>0,b<1anda+6=1. Suppose that 


(a1, ete 1» Ar, 1, tee Bras (M1, Pax 5 Vena Oly £4 -: Or) € Sp(r1,T2)- 
We now show that 


(aay + by1,.--,@0%, + byr,,4 4 + b01,...,46-, + bd,,) € Sp(ri, 12). 


We have 
TL T2 TL TL v2 T2 
S Jaa; + by; +255 |aB; +05;| < So ala;| + S— dly;|+25— a|8;|+25— 016;| < 
j=l j=l j=l j=l J=1 j=l 


r1 T2 TL r2 
al SY lasl+25°18|) +) So byl +25 16) | <aB+oB = (a+0)B=B, 
j=l j=l 


j=l j=l 
so Sip(r1, 172) is convex. 


Claim 3.4 B 
T\rT2 B” 
V(Sp(r1,72)) = 2" (5) 


nl - 


We use a double induction on ryand rg. For r; = 1 and rz = 0, we are looking at the length 
of the line segment [—B, B] in R, so in this case, 


V(S,(1,0)) = 2B =2" es B” 


2 


nl 

If r,; = 0 and rz = 1, we are (essentially) looking at the disc of radius B/2 in R? (since 
R? =C). Thus, in this case, 

a 2 B” 


V(Sp(0,1)) = 7B?/4 = 2" (< 


ni” 


This completes the induction step. The induction hypothesis that we assume is 


k Bn 
V(Sp(m,k)) = 2™ (5) WT for any m <r, and any k < ro. 
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First, we calculate V(S'p(r1 + 1,1r2)). In this case, Sg(r; + 1,12) is defined by the relations 


ry r2 
lo] + S> Ja;|+ 25° |6;| < B with aeR, (3.21) 
j=l j=l 


and |a| < B since B £0. Therefore, using the induction hypothesis, 


p 2" T\ "2 2 ryt2r 
V(Sa(ri +1,r2)) = ‘ite V(SB_ja\ (71,72) )da = (m4 2ra)! (5) [@ =: |ail) da = 


r r B r 7 B 
(ry + 2r2)! 2 0 (ry + 2re)! 2 0 


after a simple change of variables and this equals, 


grivl T\r2 Brnteret re T\r2 B” 
patsy et 
(ry + 2r2)! (5) ry + 2ro+1 2 n! 


To complete the claim, we now calculate Vg(S(ri,r2 + 1)). In this case, S(r1,r2 + 1) is 
given by 


T1 T2 
S/laj|+25—[8;| +216] < B, 
j=l j=l 


where 8 = «+ y/—1 € C. Thus, in a similar fashion to the above, using the induction 
hypothesis, we have 


Oet T2 
Sis ( B—2y/a2 + y2) 4?" dady, 
Va(S(r1,72 + 1)) ioe 2 Tassel x? + y?) axdy 


and after a change of variables we get that the latter equals 


9r1 Tre B/2 2Qr 
el [ @ — 2w) 142r2cdudw = 
(ry + 2r)! + 2ro)! 


a (ry a ae = 2) 477d, 
Letting 2w = z and using integration by parts, we deduce 
Brit2r242 
A(ry + 2ro + 1)(r1 + 2re + 2) 


B/2 
| (B= 2)" t?"2udw = 
0 


Hence, 
rotl Britere+2 


(ry + 2rg + 2)? 


Ve(S(r1,r2 +1)) = 2 (5) 


and Claim 3.4 is proved. 
Let € be arbitrarily chosen in R*, and define B > 0 by 


ae? 
B'(6)= B= (=) nimy/|Ar| +e. (3.22) 
7 
Then by Claim 3.4, 


re B™ €2"1 (m7 /2)"2 
V(Sp) =2" (2) 2 = arte yfagi + SEM" s o-emyBep2". (8.23) 


We have one more result to establish that will allow us to invoke Minkowski’s Convex Body 
Theorem via (3.23). 
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Claim 3.5 V(Op(M)) = 2-"my/Ap| 


SinceO pf injects Or into R” in a natural way—see Remark 3.13 on page 112—then9 p(Dr) 
is a full lattice in R”. If {a1,...,a,} is an integral basis for F’, then the Or(a;) are R- 
linearly independent vectors in R”. Let 6; for 7 =1,2,...,n be the embeddings of F' in C, 
and let 2; denote the vector, 


(91 (a), + + Ory (Oi); Rr +1 (4); FOr 41 (Qe))s «+ RO ry 40 (i), F(Ory +0 (Qi). 
Then with the ¢; as row vectors, 
V(Or(Op)) = det(li) = (2V—1)-™ det (8; (ai)) = 2-"*| det (8;(ai))| = 2-7 V[Arl, 
since for any y € C, 
R(y) = (y + 9)/2, and S(y) = (y— 9)/(2V-1). 


Now Claim 3.5 follows by induction on m. 
By Claim 3.5, there exists a nonzero a = a(e) in M such thatO -(a) € Sg. Thus, since 


ryt+re 


\Ne(a)| = TT 1¢s(| I] la), 


j=rnitl 


then by the Arithmetic-Geometric Mean Inequality given on page 339, 


n 


2 ritre Br 
|Nr(a PG Ne SY IG} <=, 


nr 
j=ritl 


where the last inequality is from Equation (3.21). Therefore, by (3.22), 
A\" nl! € 
ar} on n 


Note that if € is in the interval (0,1), there are only finitely many possibilities for a = a(e). 
Hence, there exists an ap € M such that Equation (3.24) holds for all positive «. Thus, 


4\"? n! 
INe(ag)l< (2) Smy/[Asl, 


as required. 


Theorem 3.10 will be applied below to the problem of proving the finiteness of the cardinality 
of the class group. Thus, we restate it as follows, in terms of ideals, which we may invoke 
directly for convenience. 


Corollary 3.5 Let F be a number field with |F : Q| =n =r, + 2ra, where {ri,r2} is the 
signature of fF’. Then for any integral 0 r-deal J, there exists a nonzero a € I such that 


INe(@)| < (=) EVGA. (3.25) 


For what ensues, the reader is reminded of Definition 3.7 and Remark 3.7 on page 100. 
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Theorem 3.11 — Finiteness of the Ideal Class Group 
If F is a number field, then ho, = |Co,| < c. 


Proof. Via Remark 1.13 on page 26 and Definition 3.7, every ideal class H of fractional 0 p- 
ideals H contains an integral D p-ideal J. Also, there exists an integral ideal J € I~? € Co, 
so IJ ~ 1. By Corollary 3.5, there exists a nonzero a € J such that 


etal s (4) SVIBRINGS). 


Since J | (a), we may set H = aJ~!, so H ~ I and via Corollary 2.8 on page 85, 
|Nr(a)| 4\"? nl! 
N(A) = < V|Ar|. 
Go N(J) 7 \a nr |Ar| 
By Exercise 2.53 on page 86, there are only finitely many integral ideals with a given norm, 


so there are only finitely many choices for J. Given that I = J = H, then there are only 
finitely many choices for the classes H, namely |Co,| < co. 


Immediately from the proof of Theorem 3.11, we have the following important fact. 


Corollary 3.6 If F is a number field whereA f is the discriminant of F and |F': Q| =n 
with signature {71,72}, then every ideal class in Cp, contains a nonzero integral ideal I 
such that 


ND) < (2\" m /[Brl. (3.26) 


The right-hand side of (3.26) is a distinguished quantity. 


Definition 3.12 — The Minkowski Bound 
If F is a number field, the quantity 


4\"? n! 
Mem (2) pve 


is called the Minkowski bound, where Ar is the discriminant of F’ and |F : Q| = n with 
signature {r1,r2}. 


Remark 3.14 Corollary 3.6 tells us that every ideal class in Co, has a nonzero integral 
ideal with norm less than Myr. We can say more. Since N(I) > 1 for any integral ideal, 


then by Corollary 3.6, 
2n 


te ue (3.27) 


4) (nl)? 
which is Minkowski’s lower discriminant bound. Moreover, if n > 1, namely for F 4 Q, 
|Ar| > 1. We can say more as follows. 


|Ar| ea 


Corollary 3.7 For any number field F' with |F' : Q| =n, 


lAr| > ei (FE) am 
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Proof. By Stirling’s Formula—see Equation (A.7) on page 339— 


n e7o/(12n)+n 


:) 
n! 2mm 


for some a € R is located in the interval (0,1). Using (3.27), and the fact that 


eo/(12n) 1/12 = 12/11 
Lellee 3 Ai) /11, 


we get 


an)" S)-O" (SEY 


(2 (EY am2 (BY (2) 


where the last inequality follows from the fact that 


2 


since 7/4 <1. 
Corollary 3.8 For a number field F' with discriminantA pf, 


lim min (ArT = oo. 
n=00 |F:Q|= 


Proof. Since (me?)/4 > 5, then ((me?)/4)" > n, so by Corollary 3.7 we have the result. 
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(3.28) 


This fact places us in a position to present the following classical result due to Hermite, 


who published the result in Crelle’s Journal in 1857. 


on January 14, 1901 in Paris, France. 


Biography 3.7 Charles Hermite (1822-1901) was born on December 24, 1822 
in Dieuze, Lorraine, France. He was educated at Ecole Polytechnique, where 
he later taught. He is perhaps best known for his proof, published in Comptes 
Rendus de l’Académie des Sciences in 1873, that e is a transcendental num- 
ber. Using similar ideas to those of Hermite, C.L.F. Lindemann (1852-1939) 
produced a proof appearing in a paper entitled “Uber die Zahl a,” published 
in Mathematische Annalen in 1882, that 7 is also transcendental. (Linde- 
mann is also known for having published two invalid “proofs” of FLT in 1901 
and 1907.) A number of other mathematical entities bear Hermite’s name: 
Hermitian matrices, Hermite polynomials, Hermite differential equations, and 
Hermite’s formula of interpolation. On the human side, Hermite was a friend 
and supporter of Georg Cantor, when the latter was suffering his many nervous 
breakdowns. Also, Poincaré was Hermite’s best known student. Hermite died 


Theorem 3.12 — Hermite’s Theorem on Discriminants 


There are only finitely many number fields having a given discriminant d € Z. 


118 3. Class Groups 


Proof. By Equation (3.28), for a given d € Z, there exists a dg € N such that if n > do, 


then 
11 me? \" = 


Therefore, if |F : Q| > do, then |Ar| > |d|. Hence, it suffices to prove that for an arbitrarily 
chosen but fixed n,d € N, there exist only finitely many number fields F' such that |Ar| < |d| 
and |F' : Q| =n. By Remark 3.14 on page 116, the result is true for |d| = 1. Assume that 
d > 1. Then the case r; = 1 and rg = 0 is impossible. If r; = 0 and rg = 1, then n = 2, 
so F = Q(VD) for some squarefree D < 0. By Definition 1.33 on page 46,A ~ = 4D or 
Ar = D, so there is at most one quadratic field withA » = —d. We may now assume that 
r=T,+1T2g> 1. The balance of the proof is devoted to proving the existence of a primitive 
element for F' that comes from a finite set. In other words, we now establish the existence 
of an 6 € F such that F = Q(6) with 6 in a fixed finite set depending only on d. To this 
end, we define the following sets, broken down into two cases. 


Case 3.4 r,; £0 
Define the set 5; in R” by 
{(a1, ay Ory, Bry tds Vr) fe sgors Ve) : lar| <v d+ 1, av; <1 


fort = 2, 3,060.05 ,f1, and 67 +45 <1 for j =r, +1,...,r}. 


We now show that 5; is convex. Let a,b € R with a > 0,6 < 1, anda+b=1. Suppose 
that 


(a4, ace + Ory Bry ts Yritds # Be Pees (1, one » Ory) Pry $1) Or 41) fates » Pr, Or) € 81. 
For j = 2,3,...,71, we have 


laa; + bd;| < ala;| + 6|d;|<a+b=1, 


laa, + 66;| < alay| + d|d4| < avd +14+bVd4+1=(at+b)Vd+1=Vd4+1, 
and for j =r, +1,...,7, 
a(B? +7) + b(p3 +04) <a+b=1. 
Hence, 8; is convex. 
Case 3.5 r; =0 
Define the set Sz in R” by 
{(B1,V15+++5Brs Ir) + [Bil < 1, |v] < Vd +1, 67 +7 <1 for j = 2,3,...,r}. 


By a similar argument to that given in Case 3.4, $2 is convex. 


By integrating over products of intervals and discs, we get 
V(81) = 2" n"2Vd +1, and V(S2) = 2n"2-'Vd +1. 


Thus, 
V(81) aM aPVd+1 


= >? >1 


271 4/|Ap| 271 4/|Ap| = 


3.3. Geometry of Numbers and the Ideal Class Group 119 


and 


V(82) _— 2n”?—1Vd+1 


= Se Sl, 
VlAz| VlAz| 
(since ro # 0 in Case 3.5, given that r; = 0). To see that this is sufficient to invoke 
Minkowski’s Convex Body Theorem, we note that, for 7 = 1,2, we need 


V(8;) > 2"V(O(DF)). 
However, from Claim 3.5 on page 115, V(O(O)) = 2772 ,/|Apr|, so 
V(81) = 2%? Vd +1 > 2747 \/|Ap| = 2"V(O(Op)), 


and 


V (82) = 2072-1 Vd +1 > 272 4/|Ap| = 2"V(O(Dp)). 


Hence, we have the existence of a nonzero 6; € O(Or)M$§;, for 7 = 1,2. Let 6 be one of 


them. Since 
k 


M5,9(2) = > 2x) € Za] 
j=0 

with |z;| < Cg, for 7 = 1,2,...,k € N, where Cg is a constant depending only on d, there 
can only be finitely many such 6. It remains to show that F’' = Q(6). In Case 3.4, 6; is the 
only conjugate of 6 lying outside the unit sphere, since |N(6)| < 1, otherwise, and that 
is impossible. Similarly, in Case 3.5, 8; + 1./—I1, and 6; — 71V—I, with 7, 4 0, are the 
only conjugates of 6 outside the unit sphere. We have shown that in Cases 3.4-3.5, there 
exist conjugates of 6 distinct from the other conjugates. In other words, 6 has n distinct 
conjugates. Hence, F = Q(d). 


Biography 3.8 Laszlo Rédei (1900-1980) was born on November 15, 1900 
near Budapest, Hungary. After graduating, he became a secondary-school 
teacher until he was appointed professor at the University of Szeged in 1940. 
He remained there until he moved to Budapest in 1967. He did classical work on 
4-invariants of class groups of quadratic fields, as well as explicit construction of 
Hilbert 2-class fields of quadratic fields, and Euclidean algorithms in quadratic 
fields. Later, his interests moved mainly into group theory, but he also dabbled 
in combinatorics and graph theory. He died on November 21, 1980. 


Exercises 


3.22. Show that Minkowski’s Convex Body Theorem cannot be strengthened in the sense 
that the factor 2” cannot be replaced by a smaller one. 


3.23. Let M be a lattice of dimension n containing the lattice LZ of dimension n, with 
|M : L| = d € N as Z-modules. Suppose that {a1,...,Qn} is a basis for M and 
{61,..-,8n} is a basis for L such that for i =1,...,n, 


n 


b= S- 24,505 (21,5 c Z). 


j=1 


Prove that |M : L| = | det(z;,;)|. 
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3.24. 


3.25. 


3.26. 


3.27. 
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Let G be a free abelian group of rank n, and let H be a subgroup of G. Prove that 
G/H is finite if and only if the rank of H is n. Conclude that a subgroup H of a 
lattice DL that has finite index in Z must also be a lattice. 


For j = 1,2,...,n EN, let 
Fy(@1,---,%n) = 71901 + re j22 +++ +7 jEn, with ri; € R for? =1,2,...,n, 


called a linear form, and let L be a lattice of dimension n with discriminant D(L). 
Prove that if c; € Rt for j = 1,2,...,n satisfies the condition 


C1C2°°' Cn > | det(r;,;)|D(L), 


where det(r;,;) 40, then there exists a nonzero point (x1, %2,...,%,) of L such that 
[Fi (x1, £2, sess a) ES C1 
and 
|Fj(@1,@2,...,2n)| < cj for j = 2,3,...,n. 


(Hint: Use Minkowski’s convex body theorem.) 
(The result in this exercise is known as Minkowski’s Linearformensatz or Theorem 


on linear forms.) 


Suppose that r € R. Prove that for any m € N, there exists a p/q¢ € Q with gcd(p, q) = 
1 such that 0 << q<m and 


(Hint: Use Exercise 3.25.) 


(The result in this exercise has implications for the theory of continued fractions and 
solutions of Pell’s equation in elementary number theory—see [53, Theorem 5.8, p. 
218].) 


Let k,n,m; € N for j =1,2,...,k <nand F)(P),...,Fk(P) € Z be functions defined 
for points P in the lattice Z”. Suppose that for each 7 = 1,2,...,k, 


F(x) = Fj(y) (mod m;) 


implies that 
F(x —y) =0 (mod m,). 


Also, suppose that S is a symmetric, convex set in R” such that 
k 
j=l 


Prove that there exists a nonzero point P€ SMZ” and 
F;(P) =0 (mod m;), 
for 7 =1,...,k. 


(Hint: Use Exercise 3.25.) 
(This result was proved by L. Rédei in 1950.) 


3.3. 


3.28. 


3.29. 
3.30. 


3.31. 


3.32. 
3.33. 
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Let p be a prime not dividing m € Z. Prove that there exist integers x; for 7 = 1,2 
such that |x;| < \/p, and 

x2 = max, (mod p). 
(Hint: Use Exercise 3.27.) 


(This result is Thue’s Theorem. See [53, Theorem 1.23, p. 44] for an elementary- 
number-theoretic proof, and see [53, Biography 1.12, p. 45] for data on Axel Thue.) 


Prove that © given in Definition 3.11 is a Q-algebra monomorphism. 


Let k,n,t,m; € N for 7 = 1,2,...,k < nm such that 1 aes m,; < t*. Prove that for 
each system of linear forms Fj(x1,...,@n), there exist y; € Z, not all zero, such that 
lyj| < th/” and 

Fj(21,.--,2n) =0 (mod m,), 
for 7 =1,...,k. 


Biography 3.9 Johan Peter Gustav Lejeune Dirichlet (1805-1859) was born 
on February 13, 1805 in Dtren, that is now in Germany but was then in 
the French Empire. He taught at the University of Breslau (now Wroclaw 
in Poland) in 1827. Then he taught at the University of Berlin from 1828 to 
1855. He was appointed to the Berlin Academy in 1831. In 1855, Dirichlet 
succeeded Gauss at Gottingen. However, in the summer of 1858, he suffered a 
heart attack while at a conference in Switzerland. He returned to Gottingen 
where his illness was compounded by his wife’s death from a stroke. He died 
there on May 5. 


Dirichlet made contributions to the proof of Fermat’s last theorem in 1825. 
In 1837, his result on primes in arithmetic progression was published—see 
(54, Theorem 7.7, p. 258] for a self-contained proof. In 1838, his work on 
the formula for the class number of quadratic forms appeared. In 1839, he 
began an investigation of equilibrium of systems and potential theory. This 
led him to what we now call Dirichlet’s problem on harmonic functions with 
given boundary conditions. In 1863, his work, Vorlesungen tiber Zahlentheorie, 
contained his celebrated work on ideals and units in algebraic number theory, 
which is a central topic of this section. 


Suppose that F' is a number field and J is an integral Op-ideal. Prove that there 
exists a number field K = F(a) with a € A such that aDxK = IDK. 


(Hint: Use Exercise 3.18 on page 107 and Theorem 1.17 on page 28.) 
With reference to Exercise 3.31, prove that Op(a)N F = I. 


With reference to Exercises 3.31-3.32, prove that the following holds. Let 7 € A and 
Ox the ring of integers of any number field K. If 


Ox(y) =Oxl, 


then y = ua for some unit u € A. (Exercises 3.81-8.83 show that there is always an 
extension ring of integers of Or in which any given ideal I becomes principal as an 
“extended ideal” Ox (y). See Corollary 5.21, and Remark 5.8 on page 240 for related 
notions.) 


. Let F' be a real quadratic field with Nr(e,,) = 1. Suppose that I is an O;-ideal 


with I? = (a) for some a € Or where Nr(a) < 0. Prove that I 1. 
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3.4 Units in Number Rings 


That low man seeks a little thing to do, 
Sees it and does it; 

This high man, with a great thing to pursue, 
Dies ere he knows it. 

That low man goes on adding one to one, 
His hundred’s soon hit: 

This high man, aiming at a million, 

Misses an unit. 


from 1.113 of A Grammarian’s Funeral (1855) 
Robert Browning (1812-1869) 

English poet 

husband of Elizabeth Barrett Browning 


In §3.5, we will establish the celebrated Dirichlet unit theorem. We set the stage in this 
section by establishing results on the finite component of the unit group, namely the group 
of roots of unity. Of fundamental importance is the ring of integers of a cyclotomic field. 
This will become even more transparent later when we establish the Kronecker-Weber The- 
orem. First however, we need the following crucial result on a compositum of fields due to 
Hilbert—see Biography 3.4 on page 94. The reader should therefore be familiar with the 
discussion surrounding Application A.1 on page 325. 


Theorem 3.13 — Compositum of Rings of Integers 
Suppose that F; are number fields with number rings Or, and discriminantsA pf, for j = 1,2 
with gcd(Ap,, Ap,) = 1, and 

K=F\F 


is the compositum of F, and Fy. Then 
662640 
(where Op, Op, consists of all sums Pah aj; forn € N, a; € Op,, and 6; € Op,.) 


Proof. Since Or, Op, is the smallest subring of AK containing both Op, and Op,, then 
OrOr, CO. Thus, it remains to show that Dx C Or, Or,. If 


is an integral basis for F;, then the set consisting of all nynz products Brae is a basis 
for K over Q by Exercise 3.36 on page 129. Therefore, 8 € Ox may be represented in the 


form 
ny Nog 


B= > ae 8 
i=1 j=1 
with q;,; € Q. It suffices to show that q,,; € Z for each such i, 7. By Exercise 2.6 on page 63, 
we may let 0, for k = 1,2,...,1 be the embeddings of K in C that fix Fy pointwise. Thus, 


for each such k, 
ni n2 


64(8) = 32> 4:58) 8. 


i=1 j=1 
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Set 
n2 
i= S- 9,580”, 
j=l 
for 1 = 1,2,...,n1. Therefore, we have the nz equations 
. (a) 
9. (B) = S © 20; (8; ); 
i=1 
for k = 1,2,...,n2. We use Cramer’s rule, Theorem A.21 on page 337 to solve for the x; 
as follows. & 
a; = %/ det(O;(8; ”)). 
Set 


1 
yi = det(9;(B)”)). 
Then yj, 2; € A, and y? = Ar,. Thus, 
xjAr, = Yi € A. 
Therefore, 
n2 
tir => a j4r BO €ANF, =O, 
j=l 
However, Bz is an integral basis for Fo, so qi;Ar, € Z for each i,j. In other words, 
Gig = Miz /Mi,j7, where mj,ni,7 € Z with nj | Ar,. A similar argument shows that 


Nig | Ap,. Hence, ni,j | gcd(Ap,, Ap,) = 1, so qi; € Z for all such i,7 and we have the 
result. 


Theorem 3.14 — The Ring of Integers of a Cyclotomic Field 
If F = Q(¢,) where n € N, then Or = Z[¢,]. 


Proof. We may assume that n > 3, since the result trivially holds for n = 1,2. 
Claim 3.6 Ar | ner), 


By Theorem 1.25 on page 40, ” — 1 = ©®,(a)g(x) for some g(x) € Z[a], so we may 
differentiate both sides to get 


na”! = ©, (2) g(x) + ®n(x) g(a)’. (3.29) 


For « = Gn, (3.29) yields 
nn = 8'(Gn)a(Gn)s 


so by taking norms of both sides, 


+n? = Np(ngh-') = Nr(®n(6n)) Ne (9G). 
By Definition 2.7 on page 77, Exercise 2.31 on page 69 and Theorems 2.6—2.7 on page 71, 


Ar | Nr(®,(Cn)); 


so we have claim 3.6. 


Now we establish the theorem for a prime power. Suppose that n = p® for a prime p. 
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Claim 3.7 If @ € Op, then 
$(p*) 


7 
B= >) Ro 
a 
where ay = (1 — pa)’, and z; € Z withA fF | 23. 


If B= ee qj; with q; € Q, then for any i € N with 1 <i < ¢(p*), form 


d 
9; (8) = » qj 9; (a4), 


where 0; is an embedding of F in C for 1 < j < ¢(p*). By Cramer’s rule, 
qj = 2;/ det(;(ai)), 


where z; € A is determined in Theorem A.21, and det?(6;(a;)) =A . Therefore, a= 


GAr € ANQ=Z, by Corollary 1.11 on page 37. Hence,A p | 23, which yields Claim 3.7. 
Claim 3.8 If n = p%, then Z[¢,«] = Or. 
Since ¢,4 = 1—(1—)-), then Z[1—¢,2] = Z[¢p2], so it suffices to show that Or = Z[1—¢po]. 


By Claim 3.6, |Apr| is a power of p. If 6 € Or but 6 ¢ Z[1 — Ge], then by Claim 3.7, we 
may assume that 


$(p*) 
B= S- ay, for some d with 1 <d < ¢(p"), 
j=d 


where p{ zq. By Exercise 3.35 on page 129, Np(1 — G-) = p. Thus, 


TI;(1- Ge) _ Npr(1 — Ge) _ p 5 
(1 = Gyo) 9) (1 = Gyn) 9) (1 = Gyn) 


since for each natural number j relatively prime to p, we have 


1- Ga 
EDrp. 
1— Gp 
Therefore, 
Pp 
EDrp, 
(1 —Cpe)4 


which implies that 


(ae Se itn are: > 
(1 ~ Cpa )4 (1 = Gor d= Cpa 


In turn, this implies that 


2d Bp 
= S- 2ZjQAj—d-1 © Or. 


1 — pa (1 — Cpa)? yay 
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Thus, by Exercise 2.17 on page 68, and Exercise 3.35, 
Nr(1- Ge) =p | Nr(za) = 24”, 


Xo) | Za, a contradiction, which establishes Claim 3.8. 
Let 


b 

"7 

n= [7 
ai 


be the canonical prime factorization of n. Then for Fj = QC, 3), we have gcd(Ar,, Ar, ) = 


1, for any 24k with 1 < €,k < b, by Claim 3.6. We need ‘one more result to finish the 
Break 


Claim 3.9 If Fy, = Q(¢px.) and Fe = QG,..) fork # é, then Op, Op, = OF, ry. 
By Theorem 3.13 on page 122, 

Z Cpe pt] = Z| pee pie] = AlCpeelZ[Cp2* | = Op, Op, = Or, F,- 
Hence, by induction using Claim 3.9, 


Op a Z[Cn]; 


as required. 


The following is a stronger result than Claim 3.6 on page 123 in the case of a prime power. 


Corollary 3.9 — Discriminants of Prime-Power Cyclotomic Fields 
If F = Q(G«) # Q where p is prime, thenA p = (—1)?")/2pP*"(@P-1)-0), 


Proof. By Exercise 1.54 on page 43, we have 


Therefore, by taking derivatives, 
pra?) = pt ta? 1b 50 (z) + (a? —1)®,0(z). 


Thus, 


-1 


eet 
pices = (Ge — 1) Ba (Gye). (3.30) 
We observe that since ess ~ se primitive p*-th root of unity, we may invoke Exercise 3.35 


on page 129 to get, Nr(@e —1) = (—1)%"*)pP**. Hence, by taking norms of both sides 
of Equation (3.30), we get 


a a a-—1 
pre") = (1/6) NeW (Gpe)). (3.31) 
However, by Exercise 2.39 on page 82, and Exercise 2.31 on page 69, 
Np(®\0 (Cp) = (— 1) PP*VOR)—-DIZA p= (—1)PR V2 AR, 


Thus, via Equation (3.31), we get 


Ap = (-1)907)/2990(R*)—P°™ = (1) 9(0*)/25p**(@@—-1)-1) 
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which is the result. 


We will provide the complete generalization of Corollary 3.9 to the determination of the 
discriminant of Q(¢,) for any n € N when we have the tools to do so in Theorem 5.14 on 
page 216. 

Before establishing the main result on roots of unity for this section, we need the following 
result due to Kronecker—see Biography 2.2 on page 79. We we will substantially generalize 
the following later when we have developed the tools to do so—see Corollary 5.4 on page 200. 


Theorem 3.15 — Division of Field Discriminants in Towers 
IfQCF CK is an extension of number fields thenA - | Ax. 


Proof. By Exercise 2.42 on page 82, any integral basis for K contains an integral basis for F’. 
Let {a1,...,Q@g,a41,---;@n} be an integral basis for K where the first d elements provide 
an integral basis for F. From Exercise 2.6 on page 63, we know that |F': Q| =d|n =|K : Q|. 
Also, from that exercise we may arrange the embeddings 6;, (1 < 7 < n) of K in C in the 
following manner. Let 0;(a;) = al), and set 0;(a1) = al) for j = 1,2,...,d. Also, 
ensure that, for each 1 = 1,2...,n, we have arranged that 0;(a;) = 6.(a;), whenever j =k 
(mod d). This yields the following. 


AK = det(9;(ai))? = 


2 
BOAO. Seinen BO a 
. a GO) sg) al) ...a@ 
i eC en) me Ce) ee (n) | > 
Oat’ %a41 a4 Og41 
BO g@A) is Sd. Savels ai”) 


and by subtracting the j’” column from the (kd + 7)” column for 7 = 1,2,...,d, and 
k =1,2,...,n/d—1, this equals, 


al)... a Gaisd i 0 
@) a) 
ak? +.val 0---0 ee 0 7 
CEE) le Ode, OR Ue, oat finan Veet in) _ (a | = Ar 
Od41 Odq1 Cd41 dg41 Oge1 — Ca41 
Q) (a) (dt) a) (n) (a) 


OR? On On et @ fe eee ww ew we An — An 


where y € Ox. However, y = Ax /Ar € Q, so by Corollary 1.11 on page 37, y € Z, as 
required. 


In Definition 1.3 on page 2, we first met the notion of a primitive root of unity. Now we look 
at the group generated by them. Henceforth, for a number field F’, we denote the subgroup 
of Llp, consisting of roots of unity by Rp. 


Theorem 3.16 — The Group of Roots of Unity 


If F is a number field, then every finite subgroup G of the multiplicative group of nonzero 
elements of F consists of roots of unity, and is cyclic. In particular, Rp is a finite cyclic 
group. Moreover, |R| is an even divisor of 2Ap. 
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Proof. Suppose that |G| =n. It follows from Theorem A.3 on page 321, that there exists 
an element a € G such that a has order n and 6” = 1 for each 6 € G. By Theorem A.18 
on page 334, x” — 1 has at most n roots in C, so G has order at most n. Since a has order 
nand a,a?,...,a" =1 are all distinct then G = (a), the cyclic group of order n generated 
by a. In particular, Rp is a finite cyclic group. 

Given that {—-1,1} C Rp, then 2 | n = |Rp|. If n = pi'ps?---pr* is the canonical prime 
factorization of n in Z and since QG,a7) C F, then by Corollary 3.9 on page 125, and 
Theorem 3.15, : 


a;-1 
p,;’  (aj(pj—-1)-1) 
P; | Ar. 


Therefore, 
ep witeed 
;7  (aj(pj—-1)-1) 
ie | Ar. 
j=l 


If p; > 2, then 
py) (a;(pj —1) — 1) > ay, 
and if p; = 2, then 
py! (a;(pj —1) -1) > aj 1, 


from which the result follows. 


Now we establish a result that will allow the determination of the group of roots of unity in 
terms of their absolute value. Recall that the absolute value of y = a+ b/-—1 € C is given 
by |y| = Va? + b? = \/97, where ¥ = a — b\/—1 is the complex conjugate of y. Sometimes 
|y| is called the modulus of y. 

Theorem 3.17 — Bounds on Absolute Values 


Suppose that F’ is a number field with embeddings 6; for 7 = 1,2,...,d=|F :Q|inC, and 
r €R with r > 0. Then there exist only finitely many a € Or such that |6;(a)| <r for all 


SD aes de 
Proof. Let 
M = max {ar GL (4) cae 
2 j 
and set 
d-1 
F=8 f(x) = at +S > 2529 € Z[x] : |z;| << M 
j=0 


Then |F| < co. Set 
8={aeéF: f(a) =0 for some f(x) € F}. 
Then |8| < oo, as well. If a € F with |0;(a)| <r for all j =1,2,...,d, then 
|s;(@1(a),..-,Aa(@))] <M, 


for all 7 = 1,2,...,d, where the s; are the elementary symmetric functions given in Defini- 
tion A.16 on page 333. Since a € Op, then s;(61(a),...,0a(a)) € Z by Corollaries 1.11 on 
page 37 and A.9 on page 334. Therefore, 


d 


[[@ - 4()) € 5, 


j=l 


which implies that a € S. The result follows. 


The following result is due to Kronecker. 
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Corollary 3.10 a € Rp if and only if |@;(@)| = 1 for all j =1,...,d. 


Proof. If a € Rp, then 0;(a) € Rp, since 6;(a@)” = 1 for some n € N. Thus, |6;(a)|" = 1, 
so |6;(@)| =1. 

Conversely, by Theorem 3.17, there exist only finitely many a € Or such that |6;(a)| = 1. 
Since a® € Op satisfies |a*| = 1 for all k € N, then it follows that a* = a‘ for some k < &. 
Thus, a’—* = 1, which implies that a € Rp, as required. 


We conclude this section with a determination of Rp for a prime cyclotomic field F’. 


Theorem 3.18 — Roots of Unity in Prime Cyclotomic Fields 
Let F = Q(¢,) for p > 2 prime. Then 


Re = (—1) x (%); 


as a multiplicative group, and every element u € Up, may be written as u = wep where 
we ROU, andk € Z. 


Proof. By Theorem 3.14 on page 123, Or = Z[¢,]. Clearly, (—1) x (¢p) C Rr. Ifthe inclusion 
is proper, there isa ¢, € Rr with n { 2p. In particular, it must contain either ¢, = C4, where 
q # pis prime or Cn = ¢,2. However, C4 ¢ Q(¢p), since otherwise Cy € {1,¢p,...,¢P7*}, 
which is not possible. Since the degree of Q(¢,2) over Q is p(p — 1), then the latter cannot 
hold either. Thus, Rr = (—1) x (¢,), as required. Moreover, since there are no more 
complex units in Up,, then the last statement of the theorem must hold. 


Example 3.4 Let F = Q(¢,) for a prime p > 2, and set 


so its complex conjugate is 
1-G? _ Gi-G) 
1-¢p* Gp (1 Gp) 


Both are units in Or by Exercise 3.37. Thus, 


=-(E)(HE)-e™ 


a 


ro 
= ¢, 7. 


so if 7 is odd, then 


Hence, 


1-@) (1-G’ 
= RN Lp,. 3.32 
U 3 (ES Op ( ) 
The distinguished units v in Equation (3.32) are called cyclotomic units, about which we 
will learn more later in the text. 
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Remark 3.15 A result due to Hilbert, which he proved in 1897, says that the numbers 


0-2") 0-6 M/[0-@) 0-6)" 


where r is a primitive root modulo p and k = 0,1,...,(p—3)/2, provide a system of in- 
dependent units in Mo, for F = Z[¢,|—see Biography 3.4 on page 94. As this chapter 
progresses, we will learn substantially more about the role of units. 


The above shows that even for the relatively simple fields considered, there is somewhat of 
a difficulty in describing the structure of the units. For the general case, we will need to 
introduce some geometry to tackle the problem. We do this in §3.5. 


Exercises 


3.35. Let p be a prime, and a € N. Prove that Nr(1— G«) = p, where Cpa is a primitive 
p*-th root of unity. 


3.36. Suppose that F; = Q(a,;) are number fields, with |F; : Q| =n; for 7 = 1,2. Prove 
that 
|K :Q| < nine, 


where 
K = FF. = Q(at, az). 


Also, show that if 
ged(|Fi : Q|, [Fo : Q|) =1, 


then 
|: Q| = nino. 


Is the converse true? 


3.37. Let p be a prime, n = p* for some a EN, and F = Q(¢,,). Suppose that 7 € N such 
that gcd(j,p) = 1. Prove that 


3.38. Let a € Dr be prime, where F' = Q(¢,,) for n € N. Suppose that 


Qn en) 
for some a,b € Z. Prove that ¢2 = ¢°. 


3.39. Let n > 2 be an integer, and set F’' = Q(G¢,). We know that ¢, € Up,. Prove that 
Ne(Gn) = 1. 
3.40. Let n € N with n > 1. Prove that 


I] @-@)=>. 


1<j<n-1 
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3.5 Dirichlet’s Unit Theorem 


Experience is the name every one gives to their mistakes. 
from act 3 of Lady Windermere’s Fan (1892) 
Oscar Wilde (1854-1900) 
Anglo-Irish dramatist and poet 


In this section, the primary goal is to establish Dirichlet’s Unit Theorem, which gives, in an 
abstract fashion, a complete description of the group of units Up, of Or for any number 
field F. 


First, we need a variant of Definition 3.11 on page 112. 


Definition 3.13 — Logarithmic Representations and Spaces 


Let F' be a number field with signature {r1,r2}, where |F': Q| =n =r; + 2re, and note 
that (R*)" x (C*)"™ is the multiplicative group in R™ x C™ consisting of those elements 
with all co-ordinates nonzero. Define the map 


W: (R*)™ x (C1)? 4 RU, 


by 
W(a41,.--; Qr,; Ur; 41,+++, Ara) = (i(ar),.. 5 by (Ar, ),- +5 bg (Ary )), 
where 
L(a;) = log. (|a;|) ifl<j<ry, 
Pete SV loge(lay |?) at trae eg Sores 
Let the map 


Lp: FWR™, 


be given via the composition of functions 
Lp =Wo Or, 
whereO - is given in Definition 3.11. Then for any a € F, 


Lp(a) = (log. (|81(a)]),--- 1oge(|9rs (@)|), loge (|r, +1(@)|*), - -- log. (Or ()|”)). 


Lp is called the logarithmic representation, or logarithmic map of F, and R™*"2 is called 
the logarithmic space. 


By Exercise 3.41 on page 136, the logarithmic representation £ of Definition 3.13 is a 
homomorphism of the multiplicative group F* of nonzero elements of F' to the additive 
group of the logarithmic space R"™*"2. In fact, this is the reason for introducing logarithms 
in the first place, namely to link this section with the preceding one in the sense that the 
group Up, is multiplicative, whereas Minkowski’s Convex Body Theorem applies to lattices, 
which are additive. Hence, we now have a method that maps from one scenario to the other 
via Lp. If we consider the restriction of Lr to Up,, we begin to get the picture. 
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Lemma 3.4 — The Kernel and Image of Lp 
If F is a number field with signature {71,72}, then 
ker(L r) = Re, 
and £r(Up,,) is a lattice in R™*", having dimension less than rj + ra. 


Proof. Since £r(a) = 0 if and only if |@;(@)| = 1 for all 7 = 1,...,1r2, then by Corollary 3.10 
on page 128, 0;(a) € Rp for all such 7. Hence, ker(Lp) = Rp. Let r = ry + rq for 
convenience. Then for a € MUp,,, since 


n rT rytre 
+1 = Np(a) = O;(a) = Il 9; (a) Il 9;(a)6;(a) = 
j=l j=l j=ritl 
TL Tihs 
A(a) J] |6;(a)/, 
j=l g=ritl 
then 7 
((@) = log. (|Nr(a@)|) = log(1) = 0, 
j=l 
sO 
TL rytre 
fats.) C{Gisa.st eR 2 a2 Sap = 0},77 
j=l j=ritl 


which has dimension r — 1. To prove that £(Up,,) is a lattice, we invoke Theorem 3.8 on 
page 109. By definition, it is an additive subgroup, so we need only prove that it is discrete. 


Let a € Up,. Then |£r(a)| <r. For n EN, set 
Sn = {a € Uo, : |0;(a)| <n for all 7 =1,2,...,r}, 
called a cube with side n centered at the origin. Since for each 7 = 1,...,7r, 


It; (4;(@))| < log. (Sr (@)|) <n, 


then |0;(a)| < e” for 1 < 7 < ri, and |6;(a)|? < e” for r1) +1 <j <r. Hence, 8, has 
only finitely many points. However, 0(8,,) is an injection of 8, into the r — 1-dimensional 
hyperplane. Thus, £7({po,,) is a lattice. 


The next step toward the unit theorem is to establish that £(U,,) is actually of dimension 
r —1 rather than just contained in a hyperplane of that dimension. 


Definition 3.14 — Norms of Elements in Logarithmic Space 


If F is a number field with signature {r1,r2}, and @ € R™ x C™ with ¢ = (¢1,...,lr499), 
then the norm of £ is given by 


3.2This set is an example of a hyperplane. In topological language, an osculating hyperplane of a convex 
set S C R” is a hyperplane that has a point of its boundary in common with S, but is disjoint from the 
interior of S. Recall that the boundary of a set S is defined to be the intersection of the closure of S with 
the closure of its complement, whereas the interior of S is the set of all points s € S for which there exists a 
disc with center s, contained in S. A fundamental result concerning osculating hyperplanes is the following. 
If S is a convex set in R”, and P is a point on its boundary, there exists at least one osculating hyperplane 
of S containing P. 
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The term norm in Definition 3.14 is appropriate and in keeping with the notion of norm 
given in Definition 2.4—see (3.33) on page 132. In preparation for the following, the reader 
is reminded of linear transformations and their matrices as given in Definition A.20 on 
page 338. 


Lemma 3.5 — Linear Transformations and Norms 


Suppose that F’ is a number field with signature {r1,r2}, and let 2 € R"™ x C™. Let the 
map 
Ag: R™ x C2 1 R”™ x C™, 


be defined by Ae(a) = @x. Then ¢ is a linear transformation and det(A,) = Nr(é). 


Proof. Choose the canonical basis for R™ x C™, namely {vj} for 7 =1,2...,.n=1r1 + 2re 
where v; = (0,...0,1,0...0), where the 1 is in the j*” place for 7 = 1,2,...,r1, and 
v; = (0,...0,1+/—1,0...0) with the 1+./—1 in the j place for j =r1+1,...,r1 +72. 
Thus, if 


£ = (1, ware eee) > (£1, as ee Cae + nr 4iv-1, res 5M py + Npy V —1), 


then the matrix of Az is given by the almost diagonal matrix, 


fy O QO: 0 

0 by QO: 0 

0 ao 0 

OF ae Mry41 Nery +1 0 ; 
OM cas Nr +1 Mrz 41 0 

0 Mry Niro 

0 Nps Mry 


whose determinant is given by 


TL ritre TL ryt+re 
H4 TL @itep= Tle I] P=. 
j=l j=ritl j=l j=ri4l 


as required. 


Now we are in a position to establish the dimension of £r(LUo,.). 


Theorem 3.19 — The Dimension of £F(LUp,) 


If F is a number field with signature {r1,r2}, then £r(Mo,) is a lattice of dimension 
rp tr2—-1in RR", 


Proof. By Exercise 3.41 on page 136 and Definition 2.4 on page 65, for each a € F, 
TL r2 
Nr(@(a)) = Ne(a) = [[ 4a Tl 0; ( = Thu O(a) [J 1@(a))?. (3.33) 
j=l g=ritl g=ritl 


Therefore, for any a € Dr, Or(a) © L£r(My,) if and only if |Nr(Or(a))| = 1. Thus, 
for any £ € R™ x C™ with |Nr(Or(a))| = 1, we must have | det(A:)| = 1, by Lemma 
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3.5. Hence, the latticesO p(O-r) and A~(Or(O-r)), with |det(\,)| = 1, have the same 
fundamental parallelotopes with the same volume, namely 


V(O(9F)) = (VAcd(OWFr)))) = 27-7 V|Arl, 
by Claim 3.5 on page 115. 
Let c; € Rt for 1 <j <7ritre, and set 


S = {(1,..-, lr tre) € Rx C” : |0j| <c; for 1 <j < ri; |6|? <c form <j <ritro}. 
By the same reasoning as that given in the proof of Theorem 3.12 in Case 3.5 on page 118, 


we deduce 
ritre 


8) = 277" II Cj. 
j=l 


Now the object is to use Minkowski’s Convex Body Theorem to get certain required points 
in A\c(07(O)). To be able to invoke Minkowski’s Theorem, we need 


V(8) SQM 25 A4/ |Ar| = = 2"V V(A\0e(Or(DFr))). (3.34) 


To achieve this, we can assume that the c; were chosen such that (3.34) holds, with 
| det(Ac)| = 1. Therefore, there exists a nonzero a € Or such that Ae(Or(a)) € 8. Then, 
for = (41,...,lry4r); 


Ae(Or(a)) = (O1(a)l1,» +, Orr tro (a)bri tre), 
with 
|0;(a)e;| <¢; for 1 <j <r, and |0;(a)e;|? < c; form <j <ritre. (3.35) 


Since | det(A¢)| = 1, then by (3.33) 


ry ry tre ritre 
INe(a)| = [Pie TT leer? < Tf a. 
j=l j=ri+1 j=l 


By Theorem 3.17 on page 127, there exist only finitely many a € Or such that for all k, 


ritre 


|x(@)| < II Cj. 


Let {f1,..., 8x} be the set formed by a. Then a must be an associate of one of the 8,’s since 
the norms are the same for a and one of the 6;’s. Let a = ui; for some t = 1,...,k, where 
ui € Uo,. Also, in view of (3. 35), |0; (a) e;| = |6; (ur) £56; (6z)| < cj, for See = ae. 1; 
and |6;(a)£;|? = |0;(u1)6;0;(G)/? < ¢, for j=ry +1,...1r1 +12. 

Let aj = minycrce{|0j(5)|}- Thus, |9;(ur)| [ej] <e/aj (1 <j <1), and [6;(an)]- 
ej < /qj/aj (m1 <j S11 +72). Now we place a further restriction upon ¢ (other than 
| det(Az)| = 1), namely we assume that for some B € R*, 


1 
> pee 
and |€;,;=B (2<j<rit+re2). Hence, 
Brtre-le Cc; 
|x (u1)| < =, 4;(m)1< (2 <3 <r), 


ay aj;B 
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and 
re 
aj(ur)| <M (ry <j <r +12). 
aj 
We may also assume that B is selected to be sufficiently large so that |@;(u1)| < 1 for all 
j #1. Therefore, [;(0;(ui1)) <0 for all j = 2,...,r1 +12. Also, |Np(ui)| = 1, so by (3.33) 
on page 132, 


ry+re 
Y= 5 (8;(u1)) = 0, (3.36) 
j=l 
so 
ri+r2 
4 (A1(u1)) =— S2 G(6;(u1)) > 0. 
j=2 
Continuing in the above fashion, we can manufacture units U2, ug,...,Ur;+ro—1 © Uo,, such 
that 
(;(8;(ui)) < Oia # j, (3.37) 
and 
rytre-l 
> &(0j(us)) > 0 for all i=1,...,71 +12 -1, (3.38) 
j=l 


where (3.38) follows from the fact that 


ritra 


> 45(6;(us)) = loge(Nir(@)|) = 0, and fy +ra(B5(us)) < 0. 


with the first equality stemming from (3.33). 
Now we introduce a map that reduces the dimension by one. This will put us within striking 
distance of the main result. Let P : R+7? + R™*"2~! be given by the projection, 
P(é1, eg pe iige re (41, eer Deer 
Claim 3.10 The vectors P(£Lr(u;)) for 1 <i <1; +r2—1 are R-linearly independent. 


Let M = (m,,;) € Mnxn be the matrix given by m;,; = P(Lr(u;)), and n= 171] +12 —1 for 
convenience. Hence, mij; <0 if i #7, and 


Sm, > 0 for alli =1,...,n. (3.39) 
j=l 
We will have the result if M/ is nonsingular. Assume that it is not. Then there exist r; € R, 
not all zero, such that 
So migri = 0 for alli =1,...,n. 
j=l 
Let no € N with no < n such that |r,,,| > |r;| for all 7 =1,...,n, and assume that r,,, > 0 
(since we may otherwise replace all r; by —r;). Thus, by (3.39), 


0= PnoMno no 1 5 Mng i775 > Tno™Mno no 1 5 Mno,j | Tro > 0, 
J#No J#No 


a contradiction that establishes Claim 3.10, and hence the entire result. 
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Theorem 3.20 — Dirichlet’s Unit Theorem 
Suppose that F is a number field with signature {r1,r2}, and let m = |Rp|. Then 


Up, =ZxZx---x ZX Gm) = (ur) X (ua) X +++ X (Ur tr2—1) X (Gm), 
r1+r2—1 copies 


where ¢,, is a primitive m” root of unity. Any such system of units uj for j =1,...,r1 + 
rg — 1 is called a fundamental system of units. 


Proof. By Theorem 3.19 on page 132, there exist units u; for j = 1,...,r1 +7r2—1 such 
that Sr(Uo,) has £r(u;) as a Z-basis. Thus, for any u € Upy,,, there exist unique z; € Z 
such that 


ritre-1 
Lp(u)= SY) 7h p(uj). 
j=l 
Therefore, 
rytre-1 
Lrlu II u i | =0. 
j=l 


To complete the proof, we need to show that if £r(v) = 0 for v € Up,, then v € Rp. 
However, this is Lemma 3.4 on page 131. 


Application 3.1 —Units in Real Quadratic Fields 

A simple application of Theorem 3.20 is to a real quadratic field. Since ry = 2, and rp = 0 
for F = Q(/Apr), Uo, & (ui) x (-1), namely there exists a smallest unit u; > 1 such 
that for any u € Ulp,, u = tuft for some a € Z. We denote ui by eq, and call this 
the fundamental unit of Q(,/Af). The uniqueness is given by Dirichlet’s Theorem 
and our insistence that the unit be bigger than 1 as a generator. SinceA pr > 0, then 


Re = (—1) = (Ga). 


Example 3.5 If F = Q(V/Ar) forA r < 0, then r; = 0, and rp = 1, so Lo, = Rr as 
given by Theorem 1.29 on page 47. 


Based upon fundamental systems of units, we now show that determinants of logarithmic 
representations do not vary. This will allow for the definition of another invariant of a 
number field F’. 


Theorem 3.21 — Determinants of Logarithmic Maps 


Suppose that F’ is a number field with signature {r1,r2}, and {u;}, {vi} for? = 1,2,...,rit+ 
rg — 1 are systems of fundamental units. Then |det(L(u;))| = |det(£-(v;))|, where 
(Lr(u;)) is a matrix with entries log,(|0;(u;)|), and (£r(v;)) is a matrix with entries 
log.(|@;(vi)|), where 6; are the embeddings of F in C. 


Proof. Set r =r, + rg —1, and assume that |R-| = m. By Dirichlet’s Unit Theorem, we 
may write, for each i = 1,...,7, 


VU4= Bs II uy (gdare € Z), 


and 


Uu;= Cf a (Gea, & €E Z). 
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By uniqueness of representation of units, (d;,;)~' = (aj,;), so det((a;,;)(d;,;)) = 1. Hence, 
| det(az,;)| = | det(d;,;)]. Since 


Tr 


O;(v) = 0; (Ch) TT 05 (ux) **, 


k=1 


then log. (|8;(vi)|) = log.(|0; (Cai) + Sopa1 i,k log. (10; (wa)]) = Lopar in log. (10; (ux)l), 
where the last equality follows from Corollary 3.10 on page 128. Hence, 


| det (log. (14; (vi)|))| = | det log. (18; (ws) 1))I, 


so | det(Lr(u,))| = | det(Lr(u;)|, which is what we sought. 


Based upon Theorem 3.21, we may now define an invariant of F’. 


Definition 3.15 — Regulators 


Let {u1,...,Ur,;4+r2—-1} be a fundamental system of units of a number field F’ having sig- 
nature {ri,r2}. Then the regulator of F' is given by 


tp = |det(L£p(u;))|. (3.40) 


Computation of the regulator, given in Equation (3.40), of a number field is difficult, since 
we must know in advance a fundamental system of units. However, for real quadratic fields, 
knowledge of the fundamental unit is sufficient and tables of such units exist (for instance, 
see [49, Appendix B, pp. 287-312]). 


Example 3.6 If F = Q(/5), then r; = 2, and r = 0. Since the fundamental unit is 


é5 = (1+ V5)/2, then 
(: rf 4) 
tr = log, 5 : 


Exercises 


3.41. Prove that the logarithmic representation £ of Definition 3.13 is a homomorphism 


of the the multiplicative group F* of nonzero elements of F' to the additive group of 
R172, 


3.42. Let F be a number field. Prove that ker(L-) = Rp. 


3.43. Let F be a real quadratic number field withA » = 5(mod 8), and fundamental unit 
Ea, = (T+UVAr)/2, where T,U € Z. Let G be the subgroup of Z[,/AFr] consisting 
of the positive units. Prove that G = (€,,,) if and only if T and U are both even. 


3.44. With reference to Exercise 3.43, prove that G = (e4.) if T and U are odd. 


(This is related to a problem of Eisenstein, namely the determination of criteria 
for the solvability of the Diophantine equation |x? — Apy?|=4 with gcd(x,y) =1 
for x,y€Z. There is an underlying interplay between the two rings Z[/Ar], and 
Z\(1+/Apr)/2] that helps to explain the phenomenon. Solution of the aforemen- 
tioned Diophantine equation, for Ar =5(mod 8), is equivalent to ca, ¢ Z[VArI. 
See [49, Exercises 2.1.14-2.1.16, pp. 59-61]. Also, see Example 1.32 on page 52.) 


Ye 
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Biography 3.10 Ferdinand Gotthold Max Eisenstein (1823-1852) was born 
on April 16, 1823 in Berlin, Germany. From an early age he suffered from 
ill health. While still young, he travelled with his parents to Wales and Ire- 
land where he met W.R. Hamilton, who acquainted Eisenstein with the work 
of Abel. This inspired Eisenstein to study mathematics further, and he en- 
rolled at the University of Berlin upon his return to Germany. Subsequently 
he produced many papers, twenty-five of which were published in Crelle’s Jour- 
nal where Abel had published his pioneering work. Among his achievements 
was the introduction of generalized Jacobi sums to obtain a proof of the law 
of biquadratic reciprocity. Gauss had such respect for him that he is pur- 
ported to have said that there were only three epoch-making mathematicians: 
Archimedes, Newton, and Eisenstein. However, due to his ill health, Eisenstein 
was not allowed to fulfill this assessment. Humboldt had collected money for 
Eisenstein to travel to Sicily to improve his health. However, he died before he 
could go there. His death occurred, at the age of twenty-nine, from pulmonary 
tuberculosis on October 11, 1852 


If F is a quadratic number field and I? ~ 1 in Co,, then I is called an ambiguous class 
in Cog. If I = (a,(b+ VAr)/2) is an integral Dp-ideal, then I’ = (a, (b — /Ar)/2) is 
the conjugate ideal of I, which we introduced for prime quadratic ideals in Remark 1.24 on 
page 52 and illustrated further in Example 2.15 on page 85. Thus, via Exercise 3.20 on 
page 107, an ambiguous class of Cp, is a class I in which I ~ I’. Indeed, for an ambiguous 
class, I= 1-!. If =I’, then I is called an ambiguous 0 p-ideal. For a prime p € Z, the 
maximum elementary abelian p-subgroup of Co, is denoted by Co,.», if |\Co;,p| =p", the 
r is called the p-rank of Cp, —see Definition A.3 on page 320. We let hp,,» denote the 
order of Cog p- 

Exercises 8.45-3.54 below are devoted to studying these ideal classes, and in particular to 


establishing Gauss’ result on the 2 rank of Or—Exercises 3.48 and 3.54. Thus, in these 
exercises, we are assuming that F' is a quadratic number field with discriminant Ar. 


3.45. Suppose that eitherA - < 0 orA - > 0 and Ne(ea,) = —1. Prove that every class 
of Co,,2 has an ambiguous ideal in it. 


(Hint: Use Exercise 3.21 on page 107.) 


3.46. LetA p < 0 be the discriminant of a quadratic field F over Q, and let wa,, be defined 
as in Application 2.1 on page 77. Suppose that J = (a, btwa,,) is an integral D r-ideal 
with a > 1, b> 0, and Ne(b+wa,) < Nr(wa,)*. Prove that I ~ 1 if and only if 
a= Nr(btwa,). 


3.47. Suppose that J is an integral D y-ideal in a quadratic field F = Q(/Ar). Prove that 
N(1) | Ap if and only if 2 =I’. 


3.48. Suppose that eitherA - < 0 orA - > 0 and Ne(ea,) = —1 and thatA = is divisible 
by N distinct primes. Prove that hp, 2 = 271. 
(Hint: Use Exercises 3.45-3.46.) 

3.49. Assume thatA p > 0 and Nr(éq,) = 1. Then by Exercise 3.21, ea, = a/a’ for some 


a@ € Op. Prove that the only primitive ambiguous 9 ;-ideals are (a), Dr, Dr, and 
(aV/Dr), where Dr is the radicand of F defined in Application 2.1 on page 77. 


3.50. Suppose that a € I where I is an O-ideal with N(I) = |Np(a)|. Prove that I = (a). 
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3.51. If I is a primitive integral DO p-ideal, prove that (N(1)) = II’. 


3.52. Assume thatA - has only one prime divisor, namelyA » = g = 1(mod 4) is prime or 
Ar =8. Prove that Nr(éa,) = —l. 


(Hint: Use Exercises 3.21 on page 107 and 3.47.) 


3.53. Using Exercise 3.52, prove that for any prime p = 1(mod 4), there exist x,y € Z such 
that x? — py? = —-1. 


3.54. Assume thatA - > 0 and Nr(ea,) = 1. Prove that ho,.2 = 2%~?. 
(Hint: Use Exercises 3.49-3.52.) 


Let F be a quadratic number field. An Op-ideal I = [N(I),a] is called reduced if it 
is primitive and there does not exist an element y € I such that both |y| < N(L) and 
ly’| < N(L). Exercises 3.55-3.61 are in reference to reduction in quadratic number fields. 


3.55. Prove that ifA » > 0, then J is reduced if and only if there is an element 6 € I such 
that [= [N(J), 6], 6 > N(J), and —N(J) < pf’ <0. 
(Note that when Ar <0, then this means that there is no y € I such that |y| < N(J) 
where |y|? = yy! = Nr(y7). The notion of reduction comes from the theory of binary 
quadratic forms—see Definition 3.4 on page 90.) 


3.56. Prove that if N(Z) < \/|Ar|/2, then I is reduced. 


3.57. Prove that if I is reduced, then N(I) < /Ar, when F is real, and N(1) < \/|Ar|/3 
when F is complex. 


3.58. Let I be a primitive, ambiguous 0 -ideal, whereA - > 0. Prove that if N(I) < V/Ar, 
then either I is reduced, orA r = 0(mod 4), and J divides the ideal (,/Ar/4). 


3.59. Let I be a primitive, ambiguous 0 -ideal, whereA - > 0. Prove that there exists a 
reduced ambiguous ideal J such that J ~ I. 


3.60. Let J be a reduced ambiguous 9 -ideal, such that J 4 (1), andA - > 0. If 4 | Ar, 
then also assume that 


If (2,54 VAr/4) , where b = Ap/2 (mod 2). 
Prove that either N(I) or N(J)/2 is a nontrivial factor of the radicand D of F. 


(This exercise underlies the fact that the so-called Continued Fraction Algorithm can 
be used as a method for factoring—see [49].) 


3.61. Suppose that F is a real quadratic field. Let J be a primitive principal 9 -ideal, 
such that gcd(N(I), D) = 1, and N(I) = n? for some n € N. Prove that there is an 
Op-ideal J such that J = J?. 


Remark 3.16 Note that in Exercises 3.43-3.44, and 3.52-3.53, we are essentially dealing 
with the solutions of Pell’s equation x? — Dy? = +1. Euler misattributed a method of 
solving this equation to John Pell (1611-1685), whence its name. However, another English 
mathematician, William Brouncker (1601-1665) actually found the method. Lagrange was 
the first to prove that the positive Pell equation always has infinitely many solutions—see 
Biography 3.3 on page 93. The above exercises show that the Pell equation is actually 
about the fundamental unit of a quadratic field. Often, in an elementary number theory 
course, continued fractions are employed to solve the equation—see [53, §5.3, pp. 232-239] 
for instance. 


Chapter 4 


Applications: Equations and 
Sieves 


If we could find the answer to that [why it is that we and the universe exist], it would 
be the ultimate triumph of human reason—for then we would know the mind of God. 
from A Brief History of Time (1988). 

Stephen Hawking (1942-) 

English theoretical physicist 


This chapter is devoted to looking at how we may apply the first three chapters to the 
solutions of Diophantine equations and to factoring via the number field sieve and Pollard’s 
sieve. 


4.1 Prime Power Representation 


We have looked at representation problems, without calling them such, in Example 2.16 on 
page 85 for instance. Also, emanating from Theorem 1.30 on page 49, we may expand our 
understanding by employing it as follows, some of which is adapted from [54]. 

Recall that by Corollary 3.4 on page 106, we know that hy, < oo. Also, recall from 
Application 3.1 on page 135 the definition of ¢a,, as the fundamental unit of a real quadratic 
field. 


Theorem 4.1 — Prime Representation and ho, 


Let F' be a quadratic field with discriminantA - and (wide) class number hp,. Suppose 
that p > 2 is a prime such that gcd(Ap,p) = 1 andA pf is a quadratic residue modulo p. 
Then the following hold. 


(a) If eitherA - < 0 orA p > 0 and Ne(ea,) = —1, then there exist relatively prime 
integers a,b such that 
a? = Arb? ifA FE 1 (mod 8), 
por = ae — Az b? ifA p= 0 (mod 4), 
a*+ab+4(1—Apr)b? if - =5(mod 8). 


(b) IfA 7 > 0 and Nr(e,q,) = 1, then there exist relatively prime integers a,b such that 
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+(a? = Arb?) if —— 1 (mod & 
por — +(a? = Az }?) if p= 0 (mod 4 
+(a?+ab+ (1—Ap)b?) if A=5(mod 8). 


Proof. By Theorem 1.30, since p > 2, then if (Ar/p) = 1, we have (p) = Pi P2 where P, 
are distinct prime 0 -ideals for 7 = 1,2. Thus, 


(phew) = (phew = PLP PSF ~ (1), 
since Pir ~ (1) for 7 = 1,2 by Exercise 3.18 on page 107. Hence, PP is a principal 


ideal for 7 = 1,2. Let 
hoe (: + a 
P = =— 5 


where u = v(mod 2), ifA » = 1(mod 4), and uw is even ifA - = 0(mod 4). Then via the 
proof of Theorem 1.30 we know that Pz must be the conjugate of P,, namely 


hoy u—v/Ap 
Ps — iis * 


Hence, 


2 2 
hop _ U — Apu 


so there exists an a € Up such that 


2 2 
pees (—) . 


However, 


But, by Corollary 1.12 on page 37, Op NQ=Z, so a € Uz = {+1}. Thus, 


4p'?r = +(u? — Apv’). (4.1) 


Claim 4.1 IfA 7 =0(mod 4), then ged(u/2,v) = 1, and ifA - = 1(mod 4), gcd(u,v) = 1 
or 2. 


IfA » = 1(mod 4), let g > 2 be a prime such that q | gcd(u,v). Then there exist integers 
x,y such that u = qx and v = qy, where x = y(mod 2). Therefore, by (4.1), | Apr , 
but g > 2 so q=p. Hence, 


5S +y/APr +yVJ/A 
2 * = (p) (- wy *) = Pi P2 (= 5 ‘), 


which forces Pz | pier contradicting that P, and Pz are distinct Op-ideals. We have 
shown that gcd(u,v) = 2° for some integer c > 0. It follows from (4.1) that 4° | 4soc=0 
orc=1. 

IfA » = 0(mod 4), and q is a prime such that q¢ | gcd(u/2,v), then there exist integers x, y 
such that u = 2qxz and v = qy, so 


pir = +((qv)* — (Ap/4)(qy)”) 
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which forces p = q and this leads to a contradiction as above. This is Claim 4.1. 


IfA ~ <0 then the plus sign holds in (4.1), since u? — Arv? > 0. WhenA f > 0 and there 
exists an a € Up with Nr(a@) = —1, we may multiply by 


Nr(a) = N(r+8V/ Apr) =7r? — Arps? =—-1 
to get 
—(u? — Apu?) = (r? — Aps?)(u? — Apu”) = (rut+ Apsv)? — Ap(ru + su)’. 


To complete the proof, we need only show how the a,b may be selected to satisfy parts 
(a)—(b) of our theorem. 

WhenA - = 1(mod 4), then by (4.1), if u and v are odd, 4p”®r = 0(mod 8), contradicting 
that p > 2. Thus, by Claim 4.1, gcd(u,v) = 2 so we select a = u/2 and b = v/2. If 
Ar = 0(mod 4), then by Claim 4.1, we may select a = u/2 and b = v. Lastly, whenA p = 5 
(mod 8), since u = v(mod 2), set u = b+ 2a and b = v where a,b € Z. Then (4.1) becomes, 


+4por = uy? — Apu? = (b+ 2a)? — Apb? = 4a? + 4ab4+ (1 — Ar)b?, 


so 


1 
pr =+(a? +ab4+ rie — Ar)b’), 


which secures our result. 


Remark 4.1 As a counterfoil to Theorem 4.1 on page 139, we note that, by Exercise 3.9 
on page 94, ifA is not a quadratic residue modulo a prime p > 2, then there is no binary 
quadratic form that represents p* for any positive integer k. Hence, there cannot exist 
integers (a,b,c) such that p* = ax? + bry + cy? for any integers z, y. 


Theorem 4.1 has certain value when hp, = 1. In particular, we have the following results, 
the first of which is a special case of Theorem A.27 on page 343. 


Corollary 4.1 Let p be a prime. Then there exist relatively prime integers a,b such that 
p =a’ +0? if and only if p= 2 or p=1 (mod 4). 
Proof. By Theorems 3.2 on page 92 and 3.6 on page 103, forA - = —4, 
hoy = hyy=y = 1. 


Thus, by Theorem 4.1, if (Ap/p) = 1, namely p= 1(mod 4), then p = a? +0? for a,b EN. 
Since 2 = 17+ 12, then we have one direction. Conversely, if p = a? + b?, and p > 2, then 
by Exercise 3.9 on page 94, (—4/p) = (—1/p) = 1, which implies that p = 1(mod 4). 


Corollary 4.2 Let p be a prime. Then there exist relatively prime integers a,b such that 
p =a’ + 2b? if and only if p = 2 or p=1,3 (mod 8). 


Proof. First, we know that (—8/p) = (—2/p) = 1 if and only if p = 1,3(mod 8). By 
Theorems 3.2 and 3.6, forA - = —8, 


Therefore, by Theorem 4.1, if (-8/p) = 1, p = a? + 20? for a,b E N. Also, 2 = 07 +2-1?. 
Conversely, if 
p =a’ + 2b’, and p> 2, 


then by Exercise 3.9, (—8/p) = (—2/p) = 1. 


142 4. Applications: Equations and Sieves 


Corollary 4.3 Let p be a prime. Then there exist relatively prime integers a,b such that 
p=a*+ab+ 0? if and only if p= 3 or p=1 (mod 3). 


Proof. From Exercise 4.1, (—3/p) = 1 if and only if p = 1(mod 3). By Corollaries 1.1-1.2 
on page 13, Theorem 1.28 on page 45, and Theorem 3.6 on page 103, we have that 


hgja+v=3)/2| = 1. 
Thus, by Theorem 4.1, if (Ar/p) = (—3/p) = 1, then 
p=a*+ab+ 0? for some integers a, b. 


Also 3 = 12 +1-1+1?. Conversely, by Exercise 3.9, if p > 3 and p = a? + ab + b?, then 
(—3/p) = 1. 


Corollary 4.4 Let p be a prime. Then there exist relatively prime integers a,b such that 
p =a? + 7b? if and only if p = 7 or 


p = 1,9,11,15,23,25 (mod 28). 
Proof. By Exercise 4.2, (—7/p) = 1 if and only if 
p = 1,9,11,15,23,25 (mod 28). 
Also, as in the proof of Corollary 4.3, forA - = —7, 
hoe = haat y—nyaj = h-7 = 1. 
Therefore, by Theorem 4.1, if (-7/p) =1, p= a? + 7b? for a,b EN. Also, 7 = 0?+7-1?. 


Conversely, if 
p =a? + 7b’, and p# 7, 


then by Exercise 3.9, (—7/p) = 1. 


Exercises 


4.1. Prove that (—3/p) = 1 for a prime p > 3 if and only if p = 1 (mod 3). 
(Hint: You may use (A.11) on page 342.) 


4.2. Prove that (—7/p) = 1 for an odd prime p if and only if p = 1,9, 11, 15, 23, 25 (mod 28). 


In Exercises 4.3-4.6, use the techniques of Corollary 4.3 to solve the representation prob- 
lems. 


4.3. Prove that a prime p is representable in the form 
p =a” +ab+ 30? for relatively prime a,b € Z 
if and only if 


p =1lor p=1,3,5,9, 15, 21, 23, 25, 27,31 (mod 44). 


4.1. 


4.4. 


4.5. 


4.6. 


4.7. 


4.8. 
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Prove that a prime p is representable in the form 
p =a’ +ab+5b* for relatively prime a,b € N 
if and only if p = 19 or 
p =1,5,7,9, 11, 17, 23, 25, 35, 39, 43, 45, 47, 49, 55,61, 63,73 (mod 76). 
Prove that a prime p is representable in the form 
p =a’ +ab+11b? for relatively prime a,b € Z 
if and only if p = 43 or 
p=1,9,11, 13, 15, 17, 21, 23, 25, 31, 35, 41, 47, 49, 53, 57, 59, 67, 79, 81, 


83, 87, 95, 97, 99, 101, 103, 107, 109, 111, 117, 121, 127, 133, 
135, 139, 143, 145, 153, 165, 167,169 (mod 172). 


Prove that a prime p is representable in the form 
p =a? +ab+17b? for relatively prime a,b € Z 
if and only if p = 67 or 
p = 1,9, 15,17, 19, 21, 23, 25, 29, 33, 35, 37, 39, 47, 49, 55, 59, 65, 71, 73, 77, 81, 


83, 89, 91, 93, 103, 107, 121, 123, 127, 129, 131, 135, 143, 149, 151, 153, 155, 
157, 159, 163, 167, 169, 171, 173, 181, 183, 189, 193, 199, 205, 207, 211, 215, 
217, 223, 225, 227, 237, 241, 255, 257, 261, 263,265 (mod 268). 


From Corollaries 1.1-1.2 on page 13, Theorem 1.28 on page 45, and Theorem 3.6 on 
page 103,we know that ho, = hgi(14./=163) /2] = 1. Thus, Theorem 4.1 on page 139 
informs us that odd primes p with (Ap/p) = (—163/p) = 1 satisfy that p = a? + ab+ 
41b? for some relatively prime integers a,b. Show that for b = 1, a? +a+41 is indeed 
prime for a= 0,1,...,39. 

(This is related to a result of Rabinowitsch [60], which states that for negative Ap, 
with Ap = 1(mod 4), we have that hp, = 1 if and only if x7 +2 +(1—Apr)/4 is 
prime for x = 0,1,...,||Ar|/4—1]. The reader may now go to Exercises 4.3-4.6 
and verify this fact for those values as well.) 


(See Biography 4.1 on the next page.) 


Related to the Rabinowitsch result in Exercise 4.7 is the following, known as the 
Rabinowitsch-Mollin- Williams criterion for real quadratic fields-see [46]. If F is a 
real quadratic field with discriminantA - = 1(mod 4), then |x? + x + (1 — Apr)/4| 
is 1 or prime for all x = 1,2,...,|(/Ar — 1)/2] if and only if ho, = 1 and either 
Ar =17 orA p =n? +r =5(mod 8) where r €{+ 4, 1}-see [50, Theorem 6.5.13, p. 
352]. Verify this primality for the values 


Ar € {17, 21, 29, 37, 53, 77, 101, 173, 197, 293, 437, 677}. 


(See Biography 4.2 on the following page.) 
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4.9. It is known that forA » = —20, ho, = 2 and P = (2,1+-+/—5) is an ideal representing 
the nonprincipal class. Use the identification given in the proof of Theorem 3.5 on 
page 101 to prove the following, where p # 5 is an odd prime. 

(a) p = a? + 5b? if and only if p= 1,9(mod 20). 
(b) p = 2a? + 2ab + 3b? if and only if p = 3,7(mod 20). 


Biography 4.1 The following was taken from a most interesting article about 
G. Rabinowitsch by Mordell [55]. Mordell writes: “In 1923, I attended a meet- 
ing of the American Mathematical Society held at Vassar College in New York 
State. Someone called Rainich from the University of Michigan at Ann Arbor, 
gave a talk upon the class number of quadratic fields, a subject in which I was 
very much interested. I noticed that he made no reference to a rather pretty 
paper written by Rabinowitz from Odessa and published in Crelle’s Journal. 
I commented upon this. He blushed and stammered and said, “I am Rabi- 
nowitz.” He had moved to the U.S.A. and changed his name.... The spelling of 
Rabinowitsch in this book coincides with that which appears in Crelle [60]. 


Biography 4.2 Hugh Cowie Williams was born in London, Ontario, Canada 
on July 23, 1943. He graduated with a doctorate in computer science from the 
University of Waterloo in 1969. Since that time, his research interests have been 
in using computational techniques to solve problems in number theory, and in 
particular, those with applications to cryptography. He held a Chair under 
Alberta Informatics Circle of Research Excellence (iCORE) at the University 
of Calgary (U of C) until 2009. He oversaw the Centre for Information Security 
and Cryptography (CISaC), a multi-disciplinary research centre at the U of C 
devoted to research and development towards providing security and privacy 
in information communication systems. There are also more than two dozen 
graduate students and post doctoral fellows being trained at the centre. The 
iCORE Chair is in algorithmic number theory and cryptography (ICANTC), 
which is the main funder of CISaC. The initial funding from iCORE was $3 
million dollars for the first five years and this has been renewed for another five 
years. In conjunction with this iCORE Chair, Professor Williams had set up 
a research team in pure and applied cryptography to investigate the high-end 
theoretical foundations of communications security. Previous to the iCORE 
chair, Professor Williams was Associate Dean of Science for Research and De- 
velopment at the University of Manitoba, as well as, Adjunct Professor for the 
Department of Combinatorics and Optimization at the University of Waterloo. 
He has an extensive research and leadership background and a strong interna- 
tional reputation for his work in cryptography and number theory. CISaC and 
ICANTC were acronyms coined by this author, who initiated the application 
for the Chair, and is currently a member of the academic staff of CISaC, as 
well as professor at the U of C’s mathematics department. This author and 
Professor Williams have coauthored more than two dozen papers in number 
theory, and computational mathematics, over the past quarter century. 
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4.2 Bachet’s Equation 


No enemy is worse than bad advice. 


Sophocles (c. 496—406 B.C.) 
Greek dramatist 


In this section we look at unique factorization in certain quadratic domains to find solutions 
of certain Bachet equations, those of the form 


y=atk (4.2) 
where k € Z—see Biography 4.3 on page 147. 


Theorem 4.2 — General Solutions of Bachet’s Equation 


Let F = Q(Vk) be a complex quadratic field with radicand k < —1 such that k # 1(mod 4), 
and hp, 4 0(mod 3). Then there are no solutions of (4.2) in integers x,y except in the 
following cases: there exists an integer u such that 


(k, x,y) = (41 — 3u?, 4u? + 1, € 3 $ 8u?)), 


where the + signs correspond to the + signs and ¢ = +1 is allowed in either case. 


Proof. Suppose that for k as given in the hypothesis, (4.2) has a solution. 
Claim 4.2 gcd(a, 2k) = 1. 
Given that y? = 0,1(mod 4), and k = 2,3(mod 4), then 

zg? = y? —k=1,2,3 (mod 4). 


However, x? = 2(mod 4) is not possible. Hence, x is odd. Now let p be a prime such that 


D | gcd(x, 2k), where p > 2 since x is odd. Since & is a radicand, it is squarefree, so 
pl|k = y* — 2°. (4.3) 


However, p | x so p | y, which implies that p? | (y? — x3), a contradiction to (4.3), that 
establishes the claim. 


By Claim 4.2, there exist integers r,s such that 
rx +2ks =1. (4.4) 


Claim 4.3 The 9-ideals (y+ Vk) and (y— Vk) are relatively prime. 


If the claim does not hold, then there is a prime O-ideal P dividing both of the given 
ideals by Theorem 1.19 on page 30. Therefore, by Corollary 1.7 on page 27, y+ VD € ?. 
Therefore, Wk=y t Jk (y Vk) P, so 


2Wk-Vk = 2k EP. (4.5) 


Given that 


(y + Vk)(y — Vk) = (y? —k) = (2°) = (a), 
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then by Corollary 1.7 again, since (x)? C P, then P | («)%. However, since P is prime 
P | (x), and once more by Corollary 1.7, we conclude that 


DEL. (4.6) 


Now we invoke (4.4)-(4.6) to get that both ra and 2ks are in P sol =ra+2ks € Pia 
contradiction that establishes the claim. 


By Theorem 1.26 on page 42, Op is a Dedekind domain, so by Claim 4.4 and Exercise 4.10, 
there exists an integral 0 ,-ideal J such that (y + Vk) = J°. In other words, J? ~ 1, but 
ho, # 0(mod 3), so by Exercise 4.11, J ~ 1. Thus, by Theorem 1.28 on page 45, there 
exist u,v € Z such that J = (u+vuVk). Hence, 


(y+ Vk) = (utovk) = ([u + vv’) ’ 
By Exercise 1.28 on page 19, there is a unit w in Or such that 
y+ Vk = w(u+ ov), (4.7) 


where we observe that since k # 1(mod 4), then 2 does not split in Q(Wk)—see Remark 1.24 
on page 52. Also, by Theorem 1.29 on page 47, w = +1. Now we conjugate (4.7) to get 


y—Vk = w(u—vVvk)°. (4.8) 


Hence, 
oe =y?—k=(y— Vk)(y + Vk) = w?(u + oVk)R(u — ok)? = (u? — 0?) 


Therefore, 


gu? —v?k. (4.9) 
Now by adding (4.7)—-(4.8), we get 
2y = w [(u +uVk)> + (u- oV)}] = 2w(u® + 3uv?k), (4.10) 
and by subtracting (4.8) from (4.7), we get 
2k =w [(u +uVk)> — (u- wR)] = 2wVk(3u2v + vk). (4.11) 
Hence, from (4.10)—(4.11), we get, respectively, that 
y = w(u® + 3uv?k) (4.12) 
and 
1 = w(3u7v + v®k) = we(8u? + v?k). (4.13) 


From (4.13), we get that v = tw, so from (4.9), (4.12)—(4.13), we have, 


z=wu?—k, y= w(u? + 3uk), and 1 = +(3u? + k). 


It follows that k = +1 —3u?, 2 = 4u? $1, and y = <(3u F 8u3), where ¢ = +1 is allowed in 
either case. Therefore, the two cases are encapsulated in the following 


(k, x, y) = (+1 — 3u”, 4u? + 1, €uf3 F 8u”)) 


and 


a? +k = (4u2 = 1)? +1- 3u? = 64u® + 48u4 + 9u? = (eu(3 F 8u7))? = y’, 


as required. 


As special cases, we get the following two celebrated results—see Biographies 4.4 and 4.5 
on page 148. 
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Application 4.1 —Euler’s Solution of Bachet’s Equation 


The only solutions with x,y € Z of (4.2) for k = —2 are x = 3 and y = +5. 


Application 4.2 —Fermat’s Solution of Bachet’s Equation 
The only solutions with x,y € Z of (4.2) for k = —4 are 


(x,y) € {(5, £11), (2, £2)}. 


Remark 4.2 Note that in Theorem 4.2, u is odd when k = 1 — 3u? and wu is even when 
k = —1—3u? by the hypothesis that k 4 1(mod 4), and the fact that k is a radicand, which 
precludes that & = 0(mod 4)—see Application 2.1 on page 77. 


See Exercises 4.13—4.14 for more illustrations. Also, see Exercise 4.15 for results similar 
to Theorem 4.2 on page 145 for the case where k > 0. 


Biography 4.3 Claude Gasper Bachet de Méziriac (1581-1638) was born in 
Bourg-en-Bresse in Savoy that was a region variously allied with France, Italy, 
or Spain. In his early years, he was educated by the Jesuits. Indeed, after both 
his parents died when he was only six, the Jesuit Order took care of him in a 
house belonging to the duchy of Savoy. Later, he studied with the Jesuits in 
Lyon, France, and Milan, Italy. He also spent time in Paris and Rome. His 
principal income was generated by his luxurious estate at Bourg-en-Bresse. In 
1620, he married and had seven children. By the 1630s, he developed a sequence 
of health problems including rheumatism and gout. He died on February 26, 
1638. 


Bachet’s contribution to mathematics was as a writer of books on mathematical 
puzzles, which were seminal in that later books on recreational mathematics 
were modeled after his. In 1612, for instance, he published Problemes plaisans 
et delectables qui se font par les nombres, the last edition published in 1959! 
His puzzles were largely arithmetical, such as number systems other than base 
10. Also, he was fond of card tricks, magic square problems, watch-dial puzzles 
depending on numbering schemes, and what we would call today think-of-a- 
number problems. As noted in this section, he also contributed to number 
theory, being perhaps best known for his Latin translation of Diophantus’s 
Greek book Arithmetica, in which Fermat wrote his now famous Last Theorem 
marginal notes—see Biography 4.5 on the next page. 


Exercises 


4.10. Suppose that J, J are nonzero integral R-ideals where R is a Dedekind domain with 
I and J relatively prime—see Definition 1.26 on page 29. Prove that if K is an 
R-ideal and n € N such that JJ = K”, then there exist R-ideals J, J such that 
IT=I3",J=g9", and k =Jg. 


(Hint: use Theorem 1.17 on page 28.) 
4.11. Let Or be the ring of integers of an algebraic number field F’ with class number 


hp,- Prove that if J is an integral 0 -ideal such that J” ~ 1 for some n € N with 
gcd(hp,, 2) =1, then J ~ 1. 
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4.12. 


4.13. 


4.14. 


4.15. 
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Show that the only rational integer solutions of y? = 23 —1 are x = 1 and y = 0 using 
unique factorization in Z/i]. 


Suppose that p is a prime of the form p = u? + 13v? for some u,v € N. Find all 
solutions to y? = p?” — 13, for m € N if any exist. 


(Note that 13 is the smallest value of |k| of the form |k| = 1+ 3u? such that the 
hypothesis of Theorem 4.2 is satisfied. Also, hz) =73) = 2.) 


Find all solutions of y? = x? — 193 if they exist. 


(With reference to Exercise 4.13, the next smallest |k| of the form |k| = 1+ 3u? such 
that the hypothesis of Theorem 4.2 is satisfied is |k| = 193. Also, hz y=193) = 4.) 


Suppose that k € N is a radicand of a real quadratic field F = Q(Vk) and k # 1 
(mod 4), such that hy, 4 0(mod 3), with F having fundamental unit ¢,—see Appli- 
cation 3.1 on page 135. Let ¢ = ex if e, has norm 1, and e = e? otherwise, and set 
e =T+UVk. Prove that (4.2) on page 145 has no solutions if k = 4(mod 9) and 
U = 0(mod 9). 

(Hint: Assume there is a solution (x,y) to (4.2). Then you may assume that y+Vk = 
w(ut+tuVk) for a unit w € Op and some u,v € Z, since the argument is the same 
as in the proof of Theorem 4.2.) 


(Note that more results for k > 0 of this nature, which typically involve congruences 
on T and U, may be found, for instance, in Mordell’s classic text [56] on Diophantine 
equations.) 


Biography 4.4 Leonard Euler (1707-1783) was a Swiss mathematician who 
studied under Jean Bernoulli (1667-1748)—see Biography 4.7 on page 161. 
Euler was extremely prolific. In his lifetime, he is estimated to have written 
over eight hundred pages a year. He published over five hundred papers during 
his lifetime, and another three hundred and fifty have appeared posthumously. 
It took almost fifty years for the Imperial Academy to finish publication of his 
works after his death. Euler had spent the years 1727-1741 and 1766-1783 at 
the Imperial Academy in St. Petersburg under the invitation of Peter the Great. 
Euler lost the sight in his right eye in 1735, and he was totally blind for the last 
seventeen years of his life. Nevertheless, his phenomenal memory (having the 
entire Aeneid committed to memory for example) made the difference, and so 
his mathematical output remained high. In fact, about half of his works were 
written in those last seventeen years. He died on September 18, 1783. 


Biography 4.5 Pierre de Fermat was not a professional mathematician, and 
published none of his discoveries. In fact, he was a lawyer. However, he did 
correspond with other mathematicians such as Pascal, de Bessy, and Mersenne. 
It is from this correspondence that we know about much of his work. Moreover, 
Fermat’s son found his copy of Bachet’s translation of Diophantus’ Arithmetica, 
in which he had written margin notes—see Biography 4.3 on the preceding page. 
These were published by his son, so we now have a further record of Fermat’s 
work. 


4.3. The Fermat Equation 149 


4.3 The Fermat Equation 


All animals are equal but some animals are more equal than others. 
from chapter 10 of Animal Farm (1945) 
George Orwell (Eric Blair) (1903-1950) 
English novelist 


In this section, we look at Fermat’s Last Theorem (FLT), and its related prime Fermat 
equation 

cP +yP + 2? =0. (4.14) 
It suffices to solve (4.14) in order to solve the general Fermat equation «” + y” = z” for 
n €N. As is now well-known, FLT was solved by Andrew Wiles—see [54, Theorem 10.4, 
p. 365] for a proof that is given in one paragraph at the end of the book. 
We begin with the anchor case where p = 3, provided by Gauss—see Biography 3.5 on 
page 95—then move to the larger picture provided by Kummer—see Biography 4.9 on 
page 164. The following result employs not only the unique factorization in a quadratic 
domain Z[¢3] (where ¢3 is a primitive cube root of unity) but also Fermat’s method of 
infinite descent. This method involves assuming the existence, in natural numbers, of a 
solution to a given problem and constructing new solutions using smaller natural numbers; 
and then from the new ones other solutions using still smaller natural numbers, and so on. 
Since this process cannot go on indefinitely for natural numbers, then the initial assumption 
must have been false. 


Theorem 4.3 — Gauss’s Proof of FLT for p= 3 
There are no solutions of 
ak +p +7 =0 
for nonzero a, 3,y € Or = Z[¢3], where F = Q(¢3). In particular, there are no solutions to 


3 3 3 
et+y =2Z, 


in nonzero rational integers 2, y, z. 
Proof. We assume that there are nonzero a, 3,y € Or such that 
a + +7 =0, 
and achieve a contradiction. Without loss of generality, we may assume that 


ged(a,8) = ged(a,y) = ged(B,7) = 1, 
—see Exercise 1.17 on page 6 and Remark 1.8 on page 13. Let 


Ree 

= 5 , 

—see Example 1.4 on page 2. Then Np(A) = AX’ = 3, where X’ = (3 + V—3)/2 is the 
algebraic conjugate of A. Therefore, by Corollaries 1.1—-1.2 on page 13 and Exercise 1.22 on 
page 14, \ is prime in D7. We will achieve the desired contradiction by an infinite descent 
argument. This is not done directly, but rather we get a contradiction to the equation 
a? + 63 + 393 = 0. Thus, we first show that the latter equation holds. We require two 
claims. Note that congruence of elements follows the development in 81.5 on ideals, namely 
o =w(mod v) means v | (a —w) in Or—see Remark 1.17 on page 32, as well as Exercises 
4.25-4.32 on pages 163-164 for further developments. 


h=1-G 
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Claim 4.4 If \{6 € Op, then 6 = +1(mod J). 


Let 6 = a+ b¢3, where a,b € Z. Then 6 = u+vdA, where u,v € Z. If Alu, then 6 = 0 
(mod A), a contradiction, so A { u. Since A|3, then 3 { u, so u = £1(mod 3) in Z. Thus, 
there is a t € Z such that 


=+1+4+ 3t+vX. 
But A|3, so there exists a 0 € Or such that 


6=414+toA+vA = 414 Xto+v). 


In other words, 6 = +1(mod 4), which is Claim 4.4. 


Claim 4.5 If \{ 5 € Op, then 6? = +1(mod 44). 


By Claim 4.4, we may assume that 6 = 1(mod 4) since the other case is similar. Therefore, 
6=1+A0 for some o € Dp. Thus, 


5° — 1 = (5 —1)(5 — ¢3)(6 -— G3) = Ao(Ao + 1 — C3)(Ao +1 - 3) = 


da (Ao + A)(Ao + A(1 + 3)) = AWo(o + 1)(o — 3), (4.15) 


where the last equality follows from the fact that pe, G = 0, given in Example 1.5 on 
page 2. Since 

G3 —1= (G3 +1)(Gs -—1) = (Gs +1), 
then (7 = 1(mod 4X), so since 6 = 1(mod 4), then by (4.15), 


0= (8 —1)\ 3 = o(o +1)(0 - G) = a(0 + 1)(o—-1) (mod Y). 


Hence, 


and we have Claim 4.5. 
Claim 4.6 | a7. 
Suppose that A {a 6y. Then by Claim 4.5, 


0=o07 +63 +7 =+1+1+41 (mod d*), 


from which it follows that 4 | 1 or M4 | 3. The former is impossible since » is prime, and 
the second is impossible since 


3 = (1— ¢3)(1— G3) = (1— ¢3)?(1 + Gs) = A721 + G3), 


and 1+ ¢3 is a unit, so not divisible by \?. This contradiction establishes Claim 4.6. 


By Claim 4.6, we may assume without loss of generality that A | y. However, by the gcd 
condition assumed at the outset of the proof, \ { a, and A{ B. Let n € N be the highest 
power of A dividing y. In other words, assume that 


y = A"p, for some p € Or with Af p. 


Thus, we have 
a® + 63 + 8" 93 = 0. (4.16) 
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We now use Fermat’s method of infinite descent to complete the proof. First we establish 
that n > 1. Ifn = 1, then by Claim 4.4, 


—)3p? = 08 +68 =+1+1 (mod d*). 


The signs on the right cannot be the same since » { 2. Therefore, 
—3p? =0 (mod A‘), 


forcing A | p, a contradiction that shows n > 1. Given the above, the following claim, once 
proved, will yield the full result by descent. 


Claim 4.7 If Equation (4.16) holds for n > 1, then it holds for n — 1. 


= B+ ac (8 +a) 
+ ac3 + a3 
——- ,Y = ——_, and Z = *_-. 
ae Re d 
Observe that X,Y, Z € Or by Corollary 1.1 on page 13, Equation (4.16), and the fact that 
¢3 = 1(mod A). Also, by Example 1.5, ys G3 = 0, 


X= 


X+Y+Z=0, (4.17) 


and 


BF +08 —)r 3 es 
AXAYZ= 3 = BY is = 3(=p)", 


so A3"-3 | XYZ, but \°" + XYZ, since Af p. Also, since 
B=-@X4+GY, anda=GZ-X, 
then by the gcd condition assumed at the outset of the proof, we have 
gcd(X, Y) = gced(X, Z) = gcd (Y, Z) = 1. 


Hence, each of X, Y, and Z is an associate of a cube in Dr. Also, we may assume without 
loss of generality that \°"~% | Z. By unique factorization in Or, we may let X = wé°, 
Y = uon®, and Z = usr*"—3y for some £,n,v € Or, and u; € Uo, for j = 1,2,3. 
Therefore, from (4.17), 

€3 + uan? + us r2”-3y3 = 0, (4.18) 


where uj; = uj, ‘uj for j = 4,5. Therefore, €? + ugn? = 0(mod A3). By Claim 4.5 
€3=+4+1 (mod A‘), and 7? = +1 (mod 44). 


Hence, 


1+u,=0 (mod °). 


Since the only choices for us are +1, +¢3, and +3, then the only values that satisfy the last 
congruence are u4 = +1, since A® { (4143), and \3 ¢ (+14 ¢3). If ug = 1, then Equation 
(4.18) provides a validation of Claim 4.7. If uz = —1, then replacing 7 by —7 provides a 
validation of the claim. This completes the proof. 


Theorem 4.3 is the lynchpin case for the next result. The following uses factorization in 
prime cyclotomic fields F = Q(¢,), where ¢, is a primitive p-th root of unity for a prime 
p > 2 when p{ hg, in which case p is called a regular prime. The proof is due to Kummer 
and is an application of techniques we have learned thus far. 

In the following, we note that for historical reasons and for convenience, FLT is usually bro- 
ken down into two cases. Case I is that pt xyz and Case II is that p|ayz—see Theorem 5.22 
on page 240. 
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Theorem 4.4 — Kummer’s Proof of FLT for Regular Primes—Case I 


Let p be an odd prime such that p { ho, for F = Q(G,). Then if p { xyz, the Fermat 
equation (4.14) on page 149 has no integer solution xyz 4 0. 


Proof. Assume that (4.14) has a solution x,y,z € Z with xyz #4 0. We may assume that 
x,y,z € Z are pairwise relatively prime, and we may write (4.14) as the ideal equation 


p-l 


[[(@+@y = (?. (4.19) 


j=0 
Claim 4.8 («+ Gy) and (a + Chy) are relatively prime for0 <j #k<p—1. 
Let P be a prime 0 ;--ideal dividing both of the above ideals. Therefore, P divides 
(c+ Gy) —(@+ Gy) =9Gd-G“). 


By Exercise 3.37 on page 129, \ = 1—¢, and 1 — Gok are associates for 7 # k, and by 
Exercise 3.39, ¢f is a unit, so P | (yA). By primality, P | (y) or P| (A). If P | (y), then 
P | (z) from (4.19). Since ged(y, z) = 1, there exist u,v € Z such that uy + vz = 1. Since 
y,z € P, then 1 € P, a contradiction. Hence, P | (A). By Exercise 2.24 on page 68 and 
Corollary 2.8 on page 85, 

N((\) = Ne() =p. 
Thus, by Exercise 2.45 on page 86, (A) is a prime 0 r-ideal. Therefore, P = (A), s r) | ( (z). 
By Exercise 2.46, Nr(A) | Nr(z). However, by Corollary 1.17 on page 41, No(e ve germ, 
so p = Nr(A) | z, contradicting the hypothesis. This completes Claim 4.8. 
By Claim 4.8 and Theorem 1.17 on page 28, 


(x + Cry) = I, 


for some 9 y-ideal I. Since pt ho,, then by Exercise 4.11 on page 147, I ~ 1. Hence, there 
exists an a € Dp such that 

r+ CGpy = ua’, 
where u; € Up,. By Theorem 3.18 on page 128, ui = woe for some k € Zand w € RNUp,. 
Therefore, 

e+ py = wera®. (4.20) 


By Exercise 4.32 on page 164 there exists a z1 € Z such that a = z; (mod (A)). By taking 


norms on the latter, we get 
p-1 


a? — 2f = |] (a- Gas). 


j=0 
Since ¢, = 1(mod (A)), then for each 7 = 0,1,...,p—1, 
a— Chay =a-—z, (mod ())). 


Hence, 
a? =z? (mod (A)?), 


o (4.20) becomes 
x+ Cpy = wz SS (mod (A)?). 
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However, (p) = (A)?~! by Exercise 4.19 on page 162, so 
e+ py = weiG (mod (p)). 


Since C is a unit, then 

Cp (a + Gry) = wz} (mod (p)). (4.21) 
By taking complex conjugates in (4.21), we get 

Cea + Gy) =wz) (mod (p)). (4.22) 
Subtracting (4.22) from (4.21), we get 


Co Fe 4 Bake! Cha cty =0 (mod (p)). (4.23) 


Claim 4.9 2k =1(mod p). 
If p | k, then ¢¥ = 1, so (4.23) becomes 
0= 4G -o') =u (GG - N= 9e (G- V+) =¥G AG +1) (mod (p)). 
However, by Exercise 4.20, 1+ ¢, € Up,, so 
yA =0 (mod (p)). 


Also, by Exercise 4.19, 
(p) = (AP, 
and p> 3, so A | y. Taking norms on the latter and using Exercise 2.46 again, we get that 


v) | y, contradicting the hypothesis. Therefore, k # 0(mod p). By (4.23) there exists an 
a1 € Or such that 


op=ace + 4G Sa SoG (4.24) 
By Exercise 4.21, k #1(mod p). Since k # 0,1(mod p), then 


Tk Yak KY -k-1 
a,=-—¢ oP pe pe (4.25) 


By Theorem 3.14 on page 123, 
{ls Cos eee | Go7} 


is a Z-basis of Or. Thus, if all exponents —k, 1—k, k and k — 1 are incongruent modulo 
p, then «/p € Z, contradicting the hypothesis. Thus, two of the aforementioned exponents 
are congruent modulo p. The only possibility remaining after excluding k = 0,1(mod p) is 


2k =1 (mod p). 


This establishes Claim 4.9. 
Hence, (4.24) becomes 


apCk = 2 + yp — a62* — yC3R-} = (a — y)d. 
By taking norms and applying Exercise 2.46 one more time, we get p | (a — y), namely 


x=y (mod p). 
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Thus, by (4.14) 
y =z (mod p) 


as well. Therefore, since p{ x, 


O=aP+y? +z? = 3x? (mod p). 


Thus, p = 3, which was eliminated in Theorem 4.3, so we have completed the proof. 


Now that we have completed Kummer’s verification of Case I of FLT for regular primes, 
we turn our attention to irregular primes namely those primes p such that p | ho,. We are 
interested in the number of them. Kummer stated that there are infinitely many regular 
primes. In [66], published in 1964, Siegel made this more precise by conjecturing that 
approximately e~!/? of all primes are regular, namely in the asymptotic sense using natural 
density, about 60.75% of primes are regular. However, at the time of the writing of this book, 
this still has not been proved. That there are infinitely many irregular primes is known, 
proved by K.L. Jensen in 1915, and this is the focus of our next result. The mechanism for so 
doing requires an equivalent definition of an irregular prime necessitating the introduction 
of more celebrated numbers. 


First, we need to introduce the following, which first appeared in the posthumous work Ars 
Conjectandi by Jacob (Jacques) Bernoulli in 1713—see Biography 4.7 on page 161. Also, 
the reader should be familiar with the background on the basics concerning series—see 
Appendix B. 


Definition 4.1 — Bernoulli Numbers 


In the Taylor series, for a complex variable x, 


P(e) = = OE, 


ee hae’ 


the coefficients B; are called the Bernoulli numbers. 


Example 4.1 Using the recursion formula given in Exercise 4.16 on page 161, we calculate 
the first few Bernoulli numbers: 


re Oe). Boe Be eae Bt ee | el Bes /20 0s) 
B, | 1 | -1/2 | 1/6 | 0 | -1/30 | 0 | 1/42 | 0 | —1/30 | 0 | 5/66 | 


n | 11 12 13 | 14 | 15 16 17 18 19 
B, | 0 | —691/2730 | 0 | 7/6 | O | —3617/510 | O | 43867/798 | 0 


Example 4.1 suggests that Boni; = 0 for all n € N and this is indeed the case—see 
Exercise 4.23 on page 162. 


Suppose that x,s are complex variables and set 


F(s,x) = = = 2 Bale) for |s| <2. (4.26) 
Then by comparing coefficients of x” in 
love) 5” ne co gt oo _ gf 
>. Bala) = F(s,2) = F(s)e = Do Bn Le 


we get the following. 
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Definition 4.2 — Bernoulli Polynomials 
For c € C, 


called the n-th Bernoulli polynomial. 


Example 4.2 Using the recursion formula in Exercise 4.16 again, we calculate the first few 
Bernoulli polynomials: 


1 

Ba(x) = 2 — 243 + 2? — 30° 

5 5 1 

Bs(a) = x 5t 30 F 
5 1 1 
B na 6 5 4 2 . 
6(2) = @° — 3a 52 x cas 


The following is Kummer’s notion of a regular prime which is equivalent to the one given 
on page 154. Recall that a rational number q = a/b is written in lowest terms when 
gced(a, b) = 1. 


Definition 4.3 — Bernoulli Numbers, Regular, and Irregular Primes 


An odd prime number p is said to be a regular prime if p does not divide the numerator of 
any of the Bernoulli numbers B,, when B,, is written in lowest terms for n = 2,4,6,...,p—3. 


We need the following result by Jacob Bernoulli on sums of n-th powers and Bernoulli 
polynomials. 


Lemma 4.1 — Bernoulli Numbers, Polynomials, and Sums of Powers 


For every nonnegative n € Z and k EN, 


k-1 n 
‘ Bn+1(k) = Bnai 1 m+ 1 ag 
n — n— — B;k2t1 J. 


Proof. Since F(s,x) — F(s,x — 1) = se®(®-), then 
Br4i(x) — Bn4i(x — 1) 
n+1 
Adding (4.27) for « = 1,2,...k, we get the result. 


In order to obtain a crucial result on Bernoulli numbers, which is the final lead-up to proving 
the infinitude of irregular primes, we need to Sstablish a realtionship between Bernoulli 
numbers and the Riemann zeta function 


= (x —1)". (4.27) 


ah ty for s € C with R(s) > 1 


a4 
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where #(s) = a is the real part of s = a+ b\/—1 for a,b € R—see [53, §1.9, pp. 65-72] as 
well as the development in Appendix B on pages 352-354. This was established by Euler 
as follows. 


Theorem 4.5 — Bernoulli Numbers and the Riemann Zeta Function 
For k EN, 
(27)? 
2k) = B. 
C(Qk) = opr Baal 


Proof. First we note that by putting z = 0 in Equation (4.26) and adding s/2 to both sides, 
we get (where coth denotes the hyperbolic cotangent): 


sfeet+1 Ss Ss oS sr 
= th ( ) = ee 4.2 
; ($5) BND » 2k OK)! oe 
observing that B, = —1/2 is the only nonzero, odd-indexed Bernoulli number. Then by 
setting s = 2ix in (4.28), we get 
ee i 92k 2k 
=14 1)"B 4.2 
xcot x »| )* Bak (2k)! ) (4.29) 


recalling that e’” = cosx + isinz, so cosx = (e’” + e~*”)/2 and sinx = (e*” — e~*”)/(2i). 
Secondly, from the known infinite product expansion for the sine function 


sin(x) = a Il (1 oe =) (4.30) 


—see Application B.2 on page 354—we take the logarithmic derivative of (4.30) to achieve, 


oo 2 
x 
xcot(r) =1+2 y ong? (4.31) 
n=1 


To proceed, we need the following. 


Claim 4.10 For x € C, 


Co 
Pe spoke 
Dat lge > nokqek’ 
k=1 
We have 
oo 2k oo N _97k 
x es d nT 
y =i) y (nx/x)~2* =1— lim y (=) 
nok qk N-+00 x 
k=1 k=0 k=0 


However, by Theorem B.4 on page 347 this equals 


which is Claim 4.10. 
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Now by plugging the result of Claim 4.10 into (4.31), and equating the result with (4.29), 


we get 
ber yt Bax pe x nekqek’? 


g2k—-1 2k ce 2k 2k 


z 
(2k)! PB nekqek i = = = 56 ( 2k). 


Since (—1)**1! By; > 0, then this implies the desired result, 


sO 


(—1)**" Box 


(Qr)?* _ 
[Bax = (2k) 


Corollary 4.5 For n EN, 


lim Se = 
noo n 
Proof. By Theorem 4.5, 
2(2n)! 
Bon| > ; 
|Bon| (Q0)2" 


given that ¢(2n) > 1. Since (2n)! > (2n/e)?”, by Stirling’s formula given in (A.7) on page 
339, then 


2n 
|Bon| > 2(=) ; 
Tre 


and the result follows. 


We are now ready for a key result in our pursuit to establish the infinitude of irregular 
primes. For convenience, we introduce the following notion. 


Definition 4.4 — p-Integers and Rational Congruences 


If g € Q, and p € Zis a prime, then g = a/b for a,b € Z written in lowest terms is called a 
p-integer provided that p{ b. For any n € N, a congruence 


q =q@ (mod n) with qi,q2 € Q 


means that q, — gz, written in lowest terms, is a rational number with numerator divisible 
by n. 


Remark 4.3 The term p-integer comes from the notion of a p-adic integer, which we 
will not study per se in this text since we are concentrating on a global approach—see [54, 
Chapter 6] for an introduction to p-adic analysis. 


The reader can easily verify that for any rational number q; with denominator prime to n, 
there exists a unique rational integer rg with 0 < rg <n —1 such that 


qi =T2 (mod n). 


The following result was proved independently by T. Clausen and C. von Staudt. Clausen 
was described by Gauss as a man of “outstanding talents.” The following was communicated 
to Gauss by von Staudt, who published a proof in 1840. Just prior to this, Clausen had 
published a statement of the result—see Biographies 4.6 on page 159 and 4.8 on page 162. 
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Theorem 4.6 — von Staudt—Clausen 


Let p be a prime and n € N even. If (p — 1) { n, then B,, is a p-integer. If (p — 1) | n, then 
pB, is a p-integer, and 

pB, = —1 (mod p). 
Proof. We use induction on n. Since By = 1/6, then the denominator of Bg is not divisible 
by p unless p = 2,3. If p = 3, then pBy = 1/2 is a p-integer, and pB, = 1/2 = —1(mod 3). 
If p = 2, then pBy = 1/3 is a p-integer, and pBy = 1/3 = —1(mod 2). This is the induction 
step. Now we use the fact given in Lemma 4.1 on page 155, for our case, namely 


k 
(+ 1)8.(0) = Yo (FE ") apt. 


j=0 


Therefore, 
-1 


k 
1 BAST pee 
pBy = Se(p)- -—5 D ( j ohne, (4.32) 
j=0 


where pB; for j < k is a p-integer. Consider 
Li PRED inj 
—— 4.33 
carl To, (4.38) 
which is divisible by p = 2, given that j < k, since k + 1 is odd. If p > 2, then write (4.33) 
as 
1 k+1 ny RRR Dee Gt ey 
pe eager 


where the last equality follows, via (4.33), from the symmetry property in Pascal’s triangle, 
( ei Ve (*t") see [53, Exercise 1.15, p. 14]. We have that 


k+1—-j J 
p’ | (k+1-3)!, (4.34) 
where 
ar Be Soe al i| “k+1—j7 k+1-j ~k+1-5 
r= el ee IN = < Sh 9; 
pe > pt p-1 2 J 


l=1 


with the second equality following from Theorem B.4 on page 347. Therefore, 


ps 
(k+1— 9)! 
is a p-integer, so from (4.32) and (4.34), 
kj 
P 
——— =0 dp). 
Ta eee 
Hence, pB, is a p-integer, so 
pBy = Sk(p) (mod p). (4.35) 


Also, if (p— 1) | k, then x* = 1(mod p), for 1 < « < p—1. Therefore, 


p-1 p-1 
S(p) = 5 2* => 1=p-1 (mod p), 
x=1 c=. 
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so 

Sk(p) =—1 (mod p) if (p—1) | k. (4.36) 
On the other hand, if (p — 1) { k, then let g be a primitive root modulo p. Thus, 


-Satada oe -f eo : Sj : (mod p), 


where the last equality comes from Theorem B.4 again. Therefore, since g?~! = 1(mod p) 
and g* £1(mod p), then 


S.(p) =0 (mod p) if (p—1) fk. (4.37) 
Comparing (4.35) and (4.37), we see that pB, = 0(mod p) when (p — 1) { k, so By is a 
p-integer. Similarly, comparing (4.35) and (4.36), we get that pB, = —1(mod p), when 
(p — 1) | k. 


Biography 4.6 Carl Georg Christian von Staudt (1798-1867) was born in 
the Imperial Free City of Rothenburg (now Rothenburg ob der Tauber, Ger- 
many) on January 24, 1798. He attended Gauss’s alma mater, G6ttingen, from 
1818 to 1822, the year in which he received his doctorate in astronomy from 
Erlangen, Bavaria (now Germany). In 1827, he became Professor of Mathemat- 
ics at the Polytechnic School at Nuremburg, and in 1835 at the University of 
Erlangen. One of his feats was the demonstration of how to construct a regular 
polygon of seventeen sides (a 17-gon) using only compasses. Then he turned 
his attention to Jacob Bernoulli’s numbers described above. However, he is 
principally known for his work in geometry. In 1847, he published Geometrie 
der Lage, which was on projective geometry. His work showed that projective 
geometry did not need to have reference to magnitude or number. He died on 
June 1, 1867 in Erlangen. 


Corollary 4.6 If p > 2 is prime and n € N is even with n < p—1, then 
pBn = S»(p) (mod p?). 


Proof. In the proof of Theorem 4.6, if n < p—1, then p— 1 does not divide any k < n. 
Therefore, all B, for k < n are p-integers. Hence, every term on the right-hand side of 
(4.32) is divisible by p?. 
The last result required for putting together the machinery necessary to establish the in- 
finitude of irregular primes is due to Kummer. 


Theorem 4.7 — Kummer’s Congruence 


If p is a prime and n € N is even with (p— 1){n, then B,,/n is a p-integer, and 


In this case, we say that the values B,,/n have period length p—1 modulo p when (p—1) { n 
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Proof. Let g be a primitive root modulo p with 1 < g < p, and set 


5 ae e¢ Bg" -1) 2 
POS eee gl (4.38) 


where the last equality comes from Definition 4.1 on page 154. We may also write, via 
Theorem B.4 on page 347 and the Binomial Theorem, 


F(x) = a, a;(e* — 1), (4.39) 
j=0 


where each a; is a p-integer, by (4.38). Also, since the (e* — 1)! are each linear combinations 
of the expressions: 


i’ ee (4.40) 


F(a) =a >- on, (4.41) 


where the b, are p-integers. Comparing coefficients of x” in (4.38) and (4.41), we get 


Bala” = 1) _ bn—1 


n! (n—1)! 


sO 


Since (p — 1) { n, then g” # 1(mod p), so the values g” — 1 have period length p — 1 by 
Fermat’s Little Theorem. Also, since the b, are p-integers, then B,,/n are p-integers, and 
have period length p— 1, when (p — 1) { n. 


Theorem 4.8 — Infinitude of Irregular Primes 


There exist infinitely many irregular primes. 


Proof. Let pi, p2,--: , pr be irregular primes for r € N. It suffices to prove the existence of 
an irregular prime p # p; for any j = 1,2,...,r. Let 


£ 


n=s][(;-1) =0 (mod 2), 


j=l 


where s € N may be chosen sufficiently large so that |B,/n| > 1, by Corollary 4.5 on 
page 157. Let p be a prime dividing the numerator of B,,/n, in lowest terms. If (p — 1) | n, 
then by Theorem 4.6, p divides the denominator of B,, a contradiction. Hence, (p— 1) { n, 
and pt 2 Tas Ps- Suppose that n = q(p— 1) +t, where 2 <t < p—3. By Theorem 4.7 on 
the preceding page, 

B, 

co 
Since B,/n = O(mod p), then B,/t = 0(mod p). By Definition 4.3 on page 155, p is 
irregular, and we are done. 


= (mod p). 


Bn 
nm 
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One conclusion from the results of this section and the relatively recent proof of FLT using 
elliptic curves is that the manifold attempts to prove it are far more valuable and far- 
reaching than the relevance of FLT itself. In fact, it may be said that the very existence 
of algebraic number theory itself is due to the deep and fertile ideas generated by such 
attempts to prove FLT. 


Biography 4.7 Jacob Bernoulli (1654-1705) was born on December 27, 1654 
in Basel, Switzerland. He was one of ten children of Nicolaus and Margaretha 
Bernoulli. His brother Johann (1667-1748) was the tenth child of the union, 
and the two brothers had an influence on each other’s mathematical develop- 
ment. Jacob was the first to explore the realms of mathematics, and being the 
pioneer in the family in this regard, he had no tradition to follow as did his 
brothers after him. In 1681, Bernoulli travelled to the Netherlands where he 
met the mathematician Hudde, then to England where he met with Boyle and 
Hooke. This began a correspondence with numerous mathematicians that con- 
tinued over several years. In 1683, he returned to Switzerland to teach at the 
University in Basel. He studied the work of leading mathematicians there and 
cultivated an increasing love of mathematics. Jacob’s first seriously important 
work was in his 1685 publications on logic, algebra, and probability. In 1689, 
he published significant work on infinite series and on his law of large numbers. 
The latter is a mathematical interpretation of probability as relative frequency. 
This means that if an experiment is carried out for a large number of trials, 
then the relative frequency with which an event occurs equals the probability 
of the event. By 1704, Jacob had published five works on infinite series con- 
taining such fundamental results such as that paral 1/j diverges—see Exercise 
4.17. Although Jacob thought he had discovered the latter, it had been already 
discovered by Mengoli some four decades earlier. In 1690, Jacob published an 
important result in the history of mathematics by solving a differential equa- 
tion using, in modern terms, separation of variables. This was the first time 
that the term integral was employed with its proper meaning for integration. 
In 1692, he investigated curves, including the logarithmic spiral, and in 1694, 
conceived of what we now call the lemniscate of Bernoulli. By 1696, he had 
solved what we now call the Bernoulli equation: y’ = p(x)y + q(x)y". Eight 
years after his death, the Ars Conjectandi was published in 1713, a book in 
which the Bernoulli numbers first appear—see Definition 4.1 on page 154. In 
the book, they appear in his discussion of exponential series. Jacob held his 
chair at Basel until his death on August 16, 1705, when it was filled by his 
brother Johann. Jacob was always enthralled with the logarithmic spiral men- 
tioned above. Indeed, he requested that it be carved on his tombstone with 
the (Latin) inscription I shall arise the same though changed. 


Exercises 


4.16. Prove the following recursion formula for Bernoulli numbers for n € N, 


where () is the binomial coefficient. 


(Hint: Use the fact that e® = S>*° a) 


i=0 i! 
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4.17. 


4.18. 


4.19. 


4.20. 


4.21. 
4.22. 


4.23. 


4.24. 
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Prove that )77" (1/7) diverges. 
(Hint: Assume ))°°,(1/j) =d € R and reach a contradiction.) 


Prove that, from Definition 4.2 on page 155, 


Jf 1/2 ifn=1, 
aaay={ if ifn > 1. 


(Hint: Use Exercise 4.16.) 


Biography 4.8 Thomas Clausen (1801-1885) was born in Snogebaek, Den- 
mark on January 16, 1801. Clausen took care of the livestock of a local priest, 
who in turn taught Latin, Greek, and astronomy to him. Clausen became an 
assistant at the Altona Observatory in 1824, then later he went to the Opti- 
cal Institute in Munich. His lack of any significant duties there left him with 
ample time to study mathematics and astronomy. However, his suffering from 
a degree of mental illness caused him to leave Munich and return to Altona. 
For the next two years he engaged in what many consider to be the best re- 
search of his life. In 1842, he was appointed to the observatory in Dorpat (now 
Tartu), Estonia. Then two years after that, he received his Ph.D. under the 
supervision of F.W. Bessel (1784-1846). In 1866, he was appointed director 
of the Dorpat Observatory, a post which he held until his retirement in 1872. 
During his lifetime he published more than one-hundred and fifty papers in the 
areas of mathematics, astronomy, and geophysics. Among his achievements 
was the factoring of the sixth Fermat number® in 1854 (see [71, p. 99] for a 
discussion of Clausen’s factoring method). He also found a new method for 
factoring numbers in general. He died on May 23, 1885 in Dorpat.) 


“Recall that a Fermat number is one of the form F,, = 22” +1 for any n EN. 


Let p > 2 be prime, and set A = 1 — ¢,, where ¢, is a primitive p” root of unity. 
Prove that the following ideal equation holds 


Let p > 2 be prime, and let ¢,, be a primitive pt” root of unity. Prove that 1+ Cp € Uo,, 
where F = Q(G,). 


Show that k # 1(mod p) in Claim 4.9 on page 153 of the proof of Theorem 4.4. 
Establish the following derivative formula for Bernoulli polynomials, 
Bryi(&) = (n+ 1)B,(2). 


(Hint: Replace the x by x +1 in Equation (4.27) on page 155 and differentiate with 
respect to x.) 


Prove that the Bernoulli numbers B, = 0 for n > 1 an odd integer. (Hint: Use 
Definition 4.1 on page 154.) 


Compute the Bernoulli numbers B,, for even n where 8 < n < 24. 


4.3. 


4.25. 


Note: 


4.26. 


4.27. 


4.28. 


4.29. 


4.30. 
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Let F be a number field and J a nonzero D;-ideal. In Remark 1.17 on page 32 we 
talked about congruence modulo an ideal, which we further develop here. If a,@ € J, 
we say that a and £ are congruent modulo I if a— 6 € I, denoted by 


a= (mod I). 


We call all those a € Or which are congruent to each other a residue class modulo I. 
Prove that the number of residue classes is equal to N(J). 


The balance of the exercises in this section are in reference to Exercise 4.25. 


Let R be a Dedekind domain. Prove that if gcd(a, J) = 1, then for any 8 € R, there 
is a y € R, uniquely determined modulo J, such that 


ay= (mod I). 


Furthermore, prove that this congruence is solvable for some y € 0 if and only if 


ged(a, I) | (8). 


In view of Exercise 4.26, two elements of Or that are congruent modulo J have the 
same gcd with J. Hence, this is an invariant of the class, since it is a property of the 
whole residue class. We denote the number of residue classes relatively prime to I, by 
the symbol ®(J). Let I, J be relatively prime 9 r-ideals. Prove that 


1 
#(1)=N(1[[ (1-5), 
I wa) 


where the product runs over all distinct prime divisors of J. Conclude, in particular 
that if J, J are relatively prime 0 p-ideals, then 


&(1J) = &(1) ®(J). 


Suppose that [ = ja ee where the P; are distinct Op-ideals. Prove that 


j=l 
Note that when F = Q, then ® is the ordinary Euler totient function ¢. 


Let aj; € Op for j =1...,d, and let P be a prime Op-ideal. Prove that the polynomial 
congruence 


f(z) =2¢ +ayr*! +--+ ag-1¢ +g =0 (mod P) 


has at most d solutions x € Or that are incongruent modulo P, or else f(a) = 0 
(mod ?) for all a € Or. (We also allow the case where deg(f) = 0, in which case 
f(x) = ao = O(mod P) means that ap € P.) 


Prove that the residue classes modulo J, relatively prime to J, form an abelian group 
under the multiplication given by (a+ I)(b+ JI) =ab+ TI. Prove that this group has 
order ®(J). In particular, show that if I is a prime 0 ,-ideal, then the group is cyclic. 
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4.31. Suppose that I is a nonzero D -ideal and a € Or is relatively prime to J. Prove that 
a? =1 (mod J), 
called Euler’s Theorem for Ideals. Conclude that if J = P is a prime 9 p-ideal, then 
aN()-l=1 (mod P), 
called Fermat’s Little Theorem for Ideals. 


4.32. Let P be a nonzero prime O p-ideal, and let a € DOr. Prove that there exists a z € Z 
such that a = z(mod P) if and only if a? = a(mod P), where (p) = PNZ. 


Biography 4.9 Eduard Kummer (1810-1893) was born on January 29, 1810 
in Sorau, Brandenburg, Prussia (now Germany). He entered the University of 
Halle in 1828. By 1833, he was appointed to a teaching post at the Gymnasium 
in Liegniz which he held for 10 years. In 1836, he published an important paper 
in Crelle’s Journal on hypergeometric series, which led to his correspondence 
with Jacobi and Dirichlet, who were impressed with his talent. Indeed, upon 
Dirichlet’s recommendation, Kummer was elected to the Berlin academy in 
1839, and was Secretary of the Mathematics Section of the Academy from 
1863 to 1878. In 1842, with the support of Dirichlet and Jacobi, Kummer was 
appointed to a full professorship at the University of Breslau, now Wroclaw, 
in Poland. In 1843, Kummer was aware that his attempts to prove Fermat’s 
Last Theorem were flawed due to the lack of unique factorization in general. 
He introduced his “ideal numbers” that was the basis for the concept of an 
ideal, thus allowing the development of ring theory, and a substantial amount 
of abstract algebra later on. In 1855, Dirichlet left Berlin to succeed Gauss 
at Gottingen, and recommended to Berlin that they offer the vacant chair to 
Kummer, which they did. In 1857, the Paris Academy of Sciences awarded 
Kummer the Grand Prize for his work. In 1863, the Royal Society of London 
elected him as a Fellow. He died in Berlin on May 14, 1893. 


Although Kummer may be best known for his failed attempt to prove FLT and 
the mathematics that derived from it, there are some not-so-well-known results 
that bear his name. For instance, in 1864 he published the discovery, now called 
the Kummer surface, that is a fourth order surface, based upon the singular 
surface of the quadratic line complex. This surface has sixteen isolated conical 
double points and sixteen singular tangent planes. This discovery emanated 
from his algebraic approach to geometric problems involving ray systems that 
had been studied by Sir William Rowan Hamilton (1805-1865). 
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4.4 Factoring 


The thing which is the most outstanding and chiefly to be desired by all healthy and 
good and well-off persons, is leisure with honour. 


from chapter 98 of Pro Sestio 

Cicero (Marcus Tullius Cicero) (106—43 B.C.) 
Roman orator and statesman 

—see the quotation on page 65. 


The problem of factoring rational integers has taken on significant importance in the modern 
era. To a great extent, this is due to the increased need for security in the transmission of 
sensitive data such as military or banking communications. The theory that is behind all of 
this is called cryptography, the study of methods for sending messages in secret, namely in 
enciphered or disguised form to a recipient who has the knowledge to remove the disguise or 
decipher it. The RSA cryptosystem, for instance, is based upon the presumed difficulty of 
factoring—see [51] for details on RSA and other cryptosystems. (Think of a cryptosystem, 
also called a cipher, as a method for enciphering and deciphering.) Herein we will be 
concerned with the applications of algebraic number theory to such important problems as 
factoring, but not to the cryptographic descriptions themselves, which may be found in an 
introductory text on cryptography such as [51]. 


It is somewhat surprising that long-standing problems such as Fermat’s Last Theorem have 
fallen to the sword of mathematical intellect, yet we still cannot do something as seemingly 
simple as that of factoring a 200-digit integer in reasonable computational time. However, 
this is the case. Factoring is intrinsically difficult. However, even this latter statement 
has only historical validation in the sense that a plethora of mathematicians and computer 
scientists have worked diligently to try to get efficient algorithms for factoring and, for all 
the work done, we have not advanced very far. However, there is no proof that verifies the 
intractability of factoring. 
In this section, we will look at two closely allied factoring algorithms. We first look at some 
elementary facts about factoring that will historically lead into our algorithms that are the 
feature of this section. 
@ The Integer Factoring Problem—(IFP) 
Given n €N, find primes p; for 7 = 1,2,...,r € N with p; < po <--: <p, and e; € N for 
j =1,2,...,r, such that 
cis 
n= II p;'. 
j=l 


A simpler problem than the IFP is the notion of splitting of n € N, which means the finding 
of factors r,s € N such that 1 <r < s such that n = rs. In order to solve the IFP for any 
integer, one merely splits n, then splits n/r and s if they are both composite, and so on 
until we have a complete factorization. 

Trial Division: The oldest method of splitting n is trial division, by which we mean 
dividing n by all primes up to /n. For n < 108, or within that neighbourhood, this is 
not an unreasonable method in our computer-savvy world. However, for larger integers, we 
need more elaborate methods. 


Fermat Factoring: If we have an n € N such that 


x? =y? (mod n) with 4+ y (mod n) for some z,y € Z, (4.42) 
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then n is necessarily composite since gcd(x — y,n) provides a nontrivial factor of n. This 
idea was known to Fermat who, in 1643, developed a method of factoring based upon the 
following observation. 


If n = rs is an odd natural number with 1 <r < \/n, then 
n =a? —b* where a = (r+s)/2 and b= (s—r)/2. 
Thus, in order to find a factor of n, we need only look at values x = y? — n for 


y =|V/n| +1,|/n] +2,...,(n—1)/2 


until a perfect square is found. This is called Fermat’s difference of squares method. 
Euler’s Factoring Method: This method applies only to integers of the form 


n= 2? + ay? = 27 +aw", 
where x #£ z and y £ w. In other words, n can be written in two distinct ways in this 
special form for a given nonzero value of a € Z. Then 
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(zw)? = (n — ay”)w? = —ay?w? = (z? — n)y? = (zy)? (mod n), 


from which we may have a factor of n, namely, provided that ew #+ zy(mod n). In this 
case, the (nontrivial) factors of n are given by gcd(aw + yz, n). 


The Euler method essentially is predicated on the congruence (4.42), but unlike the Fermat 
method, not all integers have even one representation in the form n = x? + ay?. 

Legendre’s Factoring Method: This method is a precursor to what we know today as 
continued fraction methods for factorization—see [51]. Legendre reasoned in the following 
fashion. Instead of looking at congruences of the form (4.42), he looked at those of the form 


a? =+py? (mod n) for primes p, (4.43) 


since a solution to (4.43) implies that +p is a quadratic residue of all prime factors of n. 
For instance, if the residue is 2, then all prime factors of n are congruent to +1 (mod 8) 
(since it is a fact from elementary number theory that 2 is a quadratic residue modulo p 
if and only if p = +1(mod 8)—see (A.10) on page 342). Therefore, he would have halved 
the search for factors of n. Legendre applied this method for various values of p, thereby 
essentially constructing a quadratic sieve by getting many residues modulo n. (A sieve 
may be regarded as any process whereby we find numbers via searching up to a prescribed 
bound and eliminating candidates as we proceed until only the desired solution set remains. 
A [general] quadratic sieve is one in which about half of the possible numbers being sieved 
are removed from consideration, a technique used for hundreds of years as a scheme for 
eliminating impossible cases from consideration.) This allowed him to eliminate potential 
prime divisors that sit in various linear sequences, as with the residue 2 example above. 
He realized that if he could achieve enough of these, he could eliminate primes up to /n, 
thereby effectively developing a test for primality. 


The linchpin of Legendre’s method is the continued fraction expansion of ./n, since he was 
simply finding small residues modulo n. Legendre was essentially building a sieve on the 
prime factors of n, which did not let him predict, for a given prime p, a different residue to 
yield a square. This meant that if he found a solution to x? = py? (mod n), he could not 
predict a solution, w? = pz*(mod n), distinct from the former. If he had been able to do 
this, he would have been able to combine them as 
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and have a factor of n provided that ew #+ pzy(mod n) since we are back to congruence 
(4.42). 

In the 1920s, one individual expanded the idea, described above, of attempting to match 
the primes to create a square. We now look at his important influence. 

Kraitchik’s Factoring Method: Maurice Kraitchik determined that it would suffice to 
find a multiple of n as a difference of squares in attempting to factor it—see Biography 4.10 
on page 173. For this purpose, he chose a polynomial of the form, kn = ax? + by?, for 
some integer /, which allowed him to gain control over finding two distinct residues at a 
given prime to form a square, which Legendre could not do. In other words, Kraitchik 
used quadratic polynomials to get the residues, then multiplied them to get squares (not a 
square times a small number). Kraitchik developed this method over a period of more than 
three decades, a method later exploited by D.H. Lehmer and R.E. Powers—see [37]). They 
employed Kraitchik’s technique but obtained their residues as Legendre had done. 


In the early 1980s, Carl Pomerance was able to fine tune the parameters in Kraitchik’s 
method described above—see [59]. We describe that process below but first need some 
notions used therein to be defined. 


An important role in factorization is played by the following notion, which we will need as 
part of the algorithm to be described. 


Definition 4.5 — Smooth Integers 


A rational integer z is said to be smooth with respect to y € Z, or simply y-smooth, if all 
prime factors of z are less than or equal to y. 


Remark 4.4 The term factor base means the choice of a suitable set of rational primes 
over which we may factor a set of integers. Also, if ¥ = {p1,po,...,px} is a factor base, 
then from knowledge about the distribution of smooth integers close to /n, the optimal k 
is known to be one that is chosen to be 


k= ¥/exp(/log(n) log log(n)). (4.44) 
Now we are ready to describe the sieve. 


Application 4.3 — The Quadratic Sieve (QS) Algorithm 


(1) Choose a factor base F = {p1,p2,...,Px}, where the p,; are primes for j = 1,2,...,k € 
N. 


(2) For each nonnegative integer 7, let t = +7. Compute 


ye =(Lvn] +t)? —n 


until & + 2 such values are found that are pz-smooth. For each such t, 


k 
ye =+][vi, (4.45) 
4=1 


and we form the binary k + 1-tuple, 


by = (vo, U1 ts V2,ty +++ ,Uk,t)s 


where v;,, is the least nonnegative residue of a; modulo 2 for 1 <i< k, vot = 0 if 
Ye > 0, and vp, = 1 if y% <0. 
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(3) Obtain a subset S$ of the values of t found in step (2) such that for each 7 = 0,1,2,...,k, 


S "viz =0 (mod 2). (4.46) 
te 
In this case, 
a ll = WE =y* (mod n), 
tes tc 


where x = |/n| + t, so gcd(x + y,n) provides a nontrivial factor of n if « #+ y 
(mod n). 


In step (2), we have that y, = x7 (mod n). Thus, if a prime p | y = 2? —n, we have 2? =n 


(mod p). Thus, we must exclude from the factor base any primes p for which there is no 
solution x € Z to the congruence 2? = n(mod p). In other words, we exclude from the 


factor base any primes p for which n is not a quadratic residue modulo p. 


Example 4.3 Let n = 60377. From Equation (4.44) on page 167, k = 13, so we choose 
the first thirteen primes for which n is a quadratic residue. They comprise our factor base 
F = {2,7, 11, 23, 29, 31, 37, 41, 53, 59, 61,67, 71}. In the table below, we see, by inspection, 
that a subset 8 of the values of ¢ such that >7,., vi. = 0(mod 2) for each i = 0,1,2,...,18 
is 8 = {—1, —3, —6, —22}. (Note that |,/n| = 245 in this case.) Thus, 

Il x? = 244? . 242? . 239? . 2237 = 50885? = x” (mod 60377), 

tS 


and 
[[ u = 2°: 7-114 - 29? - 37? = 25408? = y? (mod 60377). 
tes 


By computing both of the values, 
gcd(x — y,n) = gcd(50885 — 25408, 60377) = 349 


and 
gcd(a + y,n) = ged((50885 + 25408, 60377) = 173, 


we get that n = 60377 = 173 - 349. 


t Ly Yt wi 
—1 | 244 —29? (1,0, 0,0 
=3 | 242 | =77-37 | (1,0;0,0 
3 | 248 7? 23 (0, 0,0, 0 
—4 | 241 | -—2°-7-41 | (1,1,1,0 
4 | 249] 23.7.29 | (0,1,1,0 
—6 | 239 | —2%-11-37 | (1,1,0,1 
6: || B51 26. 41 (0, 0,0, 0 
7 \|,O52 53-59 (0, 0,0, 0 
(i010 
(0,0,1,1 
(1,0,0,0 
(0, 0,0, 0 
(1,1,0,0 
Ciara 
(0,1,0,1 


0,0, 0,0, 0,0, 0,0, 0 


0 
,0, 0,0, 1,0, 0,0,0,0,0 
1 
0 


0, 0,0, 1,0,0,0,0,0 


210: 935 | oF 7-98 
11 | 256 | 7-11-67 
—16 | 229 —28. 31 
16 | 261 2.112 
—20 | 225 | —2°. 23.53 
—22 | 223 | —2°-118 
99 D677 | “21s 31 


,9,0,0,0,0, 0,0, 0,0 


) 
) 
) 
) 
) 
) 
) 
0,0, 0,0, 0, 1, 1, 0, 0,0) 
) 
) 
) 
) 
) 
) 
) 


0 
0 
1 
0 
0, 0, 1,0, 0,0, 0,0,0,0 
0 
1 
0 
0 
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Some elementary linear algebra underlies the solution to a factorization problem using the 
QS as depicted in Example 4.3. By ensuring that there are k + 2 vectors vy in a k + 1- 
dimensional vector space ey we guarantee that there is a linear dependence relation 
among the v;. In other words, we ensure the existence of the set 8 in step (3) of the algorithm 
such that congruence (4.46) holds. There is no guarantee that « #+ y(mod n), but there 
are usually several dependency relations among the ;, so there is a high probability that 
at least one of them will yield an (x,y) pair such that 2 #+ y(mod n). The problem, of 
course, is that for “large” smoothness bounds B, we need a lot of congruences before we 
may be able to get these dependency relations. 


The first successful implementation of the QS in which a serious number was factored 
occurred in 1983 when J. Gerver [21] factored a 47-digit number. Then, in 1984, the 
authors of [16] factored a 71-digit number. 

The QS has been employed using an approach called factoring by electronic mail. This is a 
term used by Lenstra and Manasse in [40] to mean the distribution of the Quadratic Sieve 
operations to hundreds of physically separated computers all over the world, and in 1988 
they used this approach to factor a 106-digit number. Indeed, it is this parallel computing 
that picks up the time. 

In 1994, the authors of [2] factored the RSA-129 number*! by using the electronic mail 
factoring technique with over 1600 computers and more than 600 researchers around the 
globe. The unit of time measurement for factoring is called a mips year, which is defined 
as being tantamount to the computational power of a computer rated at one million in- 
structions per second (mips) and used for one year, which is equivalent to approximately 
3-10! instructions. For instance, factoring the RSA-129 challenge number required 5000 
mips years, and in 1989 the aforementioned factorization of the 106-digit number needed 
140 mips years. 

Now we are ready to present an algorithm that is closely tied to the QS, and is also a 
precursor for the number field sieve presented in §4.5. This algorithm involves factoring 
using certain cubic integers, namely the integers from 


Op = 2[Y=2| = 2/99] 
(since </—2 = —W2, which is the ring of integers of 
F = Q(v—2) = Q(V2), 


by Exercise 4.33 on page 173). In this section, we will show how we may employ these cubic 
integers in Z[\/—2] to factor integers in Z. Some of what follows is adapted from [54]. 


We begin with a motivating example. 
Example 4.4 We look at how to factor the fifth Fermat number 
Fe = 2°? +1. 
For convenience, set a = ¥/—2. First, notice that 
2Fs = x? +2, where « = 21), 


and that 
Ne(a — a) = 2° 4+ 2, with x —a € Zia]. 


4lIntegers with n digits that are a product of two primes of approximately the same size are denoted by 
RSA-n, called an RSA challenge number. 
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In fact, by Exercise 4.35 on page 173, any 8 = a+ba+ ca? has norm 
Np(8) = a3 — 2b? + 4c? + 6abe. (4.47) 


By Exercise 4.34, there is a prime 6 € Z[a] such that 8 | (a — a), so by Exercise 2.46 on 
page 86, 
Nr(8) | Np(x2 — a) = 2° +2. 


Hence, we may be able to find a nontrivial factorization of Fs via norms of certain elements 
of Z[a]. We do this as follows. 


Consider elements of the form a+ ba € Z[a], for convenience, and sieve over values of a and 
b, testing for 
gcd(Np(a + ba), Fs) = gcd(a® — 20°, Fs) > 1. 


For convenience, we let a run over the values 1,2,...,100, and b run over the values b = 
1,2,...20. Formal reasons for this approach will be given later. We fix each value of a, and 
let b run over its range of values. The runs for 1 <a< 15 and 1<b< 20 yield 


gcd(a? — 26°, Fs) = 1. 
However, at a = 16, b= 5, we get 
gcd(16? — 2.53, F;) = 641. 


Tn fact, 
Fs = 641 - 6700417. 


We may factor 16 + 5a as follows. 


16 + 5a = (1+ a)(—1+ a)(a)(—9 + 2a — a”), 
where 1+ a is a unit with norm —1; —1+ a has norm —3; a has norm —2; and 
B=-9+2a- 
has norm —641. This accounts for 
16° — 2-5? =2-3-641, 


and shows that ( is the predicted prime divisor of x— a, which gives us the nontrivial factor 
of Fs ‘ 


The method in Example 4.4 works well largely because of the small value of F5. However, 
it may not be feasible for larger values to check all of the gcd conditions over a much larger 
range. The following method of Pollard, which he introduced in 1991 in [58], uses the above 
notions of factorizations in Z[a] to factor F7, which was first accomplished in 1970. 

As in the above case, suppose that n € N with 


2n =m +2. 


For instance, 
2F, =m +2 


where m = 248. Pollard’s idea to factor n = F7 involves B-smooth numbers of the form 


a+bm, for some suitable B that will be the number of primes in a prescribed set defined 
in the algorithm below. Also, a+ ba will be B-smooth meaning that its norm is B-smooth 
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in the sense of Definition 4.5 on page 167. Thus, if we get a factorization of a + ba in 
Zla], we also get a corresponding factorization of a+ bm modulo F7. To see this, one must 
understand a notion that we will generalize when we discuss the number field sieve in 84.5. 
We let 

py: Zila] Z/nZ 


be a ring homomorphism such that ~(a) =m. Thus, in Z/nZ, 
x? = —2 = —(1+1), where 1 is the identity of Z/nZ. 
Hence, 7 is that unique map which is defined element-wise by the following. 


2 2 
yp S > za! = So zjmi € Z/nZ, where z; € Z. 
j=0 j=0 
The role of this map w in attempting to factor a number n is given by the following. 
Suppose that we have a set S of polynomials 


2 


(x) =) je! € Zila] 


j=0 
such that 
[[9@ =8 
ges 
where 6 € Z[a], and 
[sm =r. 
ges 


where y € Z. Then if (8) = a € Z, we have x? = (8)? = ¥(6?) = wv (Hes a(a)) = 
Hoes 9m) = y’ (mod n). In other words, this method finds a pair of integers «, y such that 


x? —y? =(4£—y)(x«+y)=0 (mod n), 


so we may have a nontrivial factor of n by looking at gcd(x — y,n). 


We now describe the algorithm, but give a simplified version of it, since this is meant 
to be a simple introduction to the ideas behind the number field sieve. We use a very 
small value of n as an example for the sake of simplicity, namely n = 23329. Note that 
2n = 36° + 2= m+ 2. We will also make suitable references in the algorithm in terms of 
how Pollard factored n = Fy. 


Application 4.4 — Pollard’s Algorithm 


Step 1: Compute a factor base. 


In the case of cubic integers in Z[a] = Z[\/—2], we take for n = 23329 only the first eleven 
primes as the factor base, those up to and including 41 (or for n = F7, Pollard chose the 
first five hundred rational primes) as ¥B,, the first part of the factor base, and for the 
second part, FB2, we take those primes of Z[a] with norms +p, where p € FB,. (The 
reasons behind the choice of the number of primes in FB, are largely empirical.) Also, we 
include the units —1,1+ a, and 1/(1+a) =—1+a— a? in FBg. Here, we have discarded 
the Z[a|-primes of norm p? or p*, since these cannot divide our n, given that they cannot 
divide the a + ba, with the assumptions we are making. 
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Step 2: Run the sieve. 


In this instance, the sieve involves finding numbers a+bm that are composed of some primes 
from FB,. For n = 23329, we sieve over values of a from —5 to 5 and values of b from 1 to 
10 (or for n = F7, Pollard chose values of a from —4800 to 4800, and values of b from 1 to 
2000). Save only coprime pairs (a, b). 


Step 3: Look for smooth values of the norm, and obtain factorizations of a+ bx and a+ba. 


Here, smooth values of the norm means that N = Nr(a+ ba) = a? — 2b? is not divisible by 
any primes bigger than those in ¥B,. For those (a,b) pairs, factor a + bm by trial division, 
and eliminate unsuccessful trials. Factor a + ba by computing the norm Np(a + ba) and 
using trial division. When a prime p is found, then divide out a Z[a|-prime of norm +p 
from a+ ba. This will involve getting primes in the factorization of the form a + ba + ca? 
where c # 0. Units may also come into play in the factorizations, and a table of values of 
(1+ a) is kept for such purposes with 7 = —2,--- ,2 for n = 23329 (or for F7, one should 
choose to keep a record of units for 7 = —8,—7,...,8). Some data extracted for the run on 
n = 23329 is given as follows. 


a+ ba + ca? N factorization of a + ba + ca? 
5+a 3-41 (—1+ a)(—1 — 2a — 2a”) 
4+ 10a —2*.11? | —(3 + 20)?a4(-1+ a —- a7)? 
-l+a —3 —-l+a 
Pelee ian oee'|| aT -1- 2a -— 20? 
3+ 2a 11 3+ 2a 
a —2 a 
—-1l+a-— a’ -1 unit 
a+bm+cm? factorization of a + bm + cm? 
o+m 41 
4+ 10m 2?.7-13 
-l+m 5-7 
Table S| |x oa a AGA 
3+2m 3-5? 
m 2? . 3? 
-1+m-m? —13-97 
Step 4: Complete the factorization. 
By selecting —1 times the first four rows in the third column of Table 4.1, we get a square 


in Z[a]: 


6? = (-14+ a@)?(—1 — 2a — 207)?(3 + 2a)?a4(-1 + a — 7)’, (4.48) 


and correspondingly, since 8? is also —1 times the first four rows in the first column of Table 
4.1, we get: 


6? = (5+ e)(—4—10a)(—1+.a)(—1 — 2a — 207). (4.49) 
Then we get a square in Z from Table 4.2 by applying w to (4.49): 


#(87) = (5 +m)(—4 — 10m)(—1 + m)(—1 — 2m — 2m?) = 2? . 5? . 7? - 13? - 41? = y?. 


Also, by applying ~ to @ via (4.48), we get: 


w(8) = (—1+m)(—1 — 2m — 2m?)(3 + 2m)m?(—1+ m — m?) = 9348 (mod 23329), 
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so by setting x = (3), we have 
2? = ¥2(B) = 4(B2) = 4? (mod n), 


Since 
y =2-5-7-13-41 = 13981 (mod 23329), 


then y — « = 4633 (mod 23329). However, gcd(4633, 23329) = 41. In fact 23329 = 41 - 569. 


Pollard used the algorithm in a similar fashion to find integers X and Y for the more serious 
factorization gcd(X — Y, Fy) = 59649589127497217. Hence, we have a factorization of F7 
as follows. 

Fr, = 59649589127497217 - 5704689200685129054721. 


Essentially, the ideas for factoring using cubic integers above is akin to the notion of the 
strategy used in the QS method. There, we try to generate sufficiently many smooth 
quadratic residues of n close to \/n. In the cubic case, we try to factor numbers that are 
close to perfect cubes. In $4.5, we will extend these ideas to show how Fo was factored 
using the number field sieve, and Z[,/2]. 


Exercises 
4.33. Prove that Z[\/—2] is the ring of integers of Q(</—2). 
4.34. Prove that every nonzero ideal in a Dedekind domain R must contain a prime element. 
4.35. Prove that (4.47) holds in Example 4.4. 
4.36. Use Pollard’s method to factor Fg. 


In Exercises 4.87-4.89, use the gcd method described before Pollard’s method to find an odd 
factor of the given integer. 


4.37, 57 —1, 
4.38. 714941. (Hint: Use Z[x/—7].) 


4.39. 3739 — 1. (Hint: Use Z[/3].) 
Factor each of the integers in Exercises 4.40-4.48 using the QS method. 


4.40. n = 3191491. 

4.41. n = 12358397. 
4.42. n = 42723991. 
4.43. n = 74299271. 


Biography 4.10 Maurice Borisovich Kraitchik (1882-1957) obtained his 
Ph.D. from the University of Brussels in 1923. He worked as an engineer 
in Brussels and later as a Director at the Mathematical Sciences section of the 
Mathematical Institute for Advanced Studies there. From 1941-1946, he was 
Associate Professor at the New School for Social Research in New York. In 
1946, he returned to Belgium, where he died on August 19, 1957. His work 
over thirty-five years on factoring methods stands tall today because he devised 
and used a variety of practical techniques that are found today in computer 
methods such as the QS method. He is also the author of the popular book 
Mathematical Recreations [34]. 
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4.5 The Number Field Sieve 


When fortune is lavish of her favours, beware of adversity; events do not always 
succeed each other in one train of fortunes. 

Cato the elder (Marcus Porcius Cato) (234 B.C.—149 B.C.) 

Roman statesman, orator, and writer 


In 84.4 we provided a motivator for the sieve in this section via Pollard’s algorithm, which 
we showed to be linked to the QS. Some of what follows is adapted from [54]. 


In 1988, John Pollard circulated a manuscript that contained the outline of a new algorithm 
for factoring integers, which we studied in §4.4. In 1990, the first practical version of 
Pollard’s algorithm was given in [39], published in 1993, the authors of which dubbed it 
the number field sieve. Pollard had been motivated by a discrete logarithm algorithm given 
in 1986, by the authors of [13], which employed quadratic fields. Pollard looked at the 
more general scenario by outlining an idea for factoring certain large integers using number 
fields. The special numbers that he considered are those large composite natural numbers 
that are “close” to being powers, namely those n € N of the form n = r* — s for small 
natural numbers r and |s|, and a possibly much larger natural number ¢t. Examples of such 
numbers, which the number field sieve had some successes factoring, may be found in tables 
of numbers of the form 


n=r' +1, called Cunningham numbers. 


However, the most noteworthy success was factorization of the ninth Fermat number Fy = 
27 +1 = 2°!241 (having 155 decimal digits), by the Lenstra brothers, Manasse and Pollard 
in 1990, the publication of which appeared in 1993—see [41]. 


To review some of the history preceding the number field sieve, we observe the following. 
Prior to 1970, a 25-digit integer was considered difficult to factor. In 1970, the power of the 
continued fraction method raised this to 50 digits—see [53, $5.4, pp. 240-242]. Once the 
algorithm was up and running in 1970, legions of 20- to 45-digit numbers were factored that 
could not be factored before. The first major success was the factorization of the seventh 
Fermat number 

Fp = 27" 41= 228 41, 


a 39-digit number, which we described via Pollard’s method in §4.4. By the mid 1980s, the 
quadratic sieve algorithm was felling 100-digit numbers. With the dawn of the number field 
sieve, 150-digit integers were now being tackled. The number field sieve is considered to 
be asymptotically faster than any known algorithm for the special class of integers of the 
above special form to which it applies. Furthermore, the number field sieve can be made to 
work for arbitrary integers. For details, see [7], where the authors refer to the number field 
sieve for the special number n = r‘ — s as the special number field sieve. The more general 
sieve has come to be known as the general number field sieve. 


Much older than any of the aforementioned ideas for factoring is that attributed to Fermat, 
namely the writing of n as a difference of two squares. However, this idea was enhanced 
by Maurice Kraitchik in the 1920s, both approaches we also reviewed in $4.4. To further 
describe Kraitchik’s influence, we review it from a slightly different perspective here. He 
reasoned it might suffice to find a multiple of n as a difference of squares, namely, 


x? =y* (mod n), (4.50) 
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so that one of x—y or x+y could be divisible by a factor of n. We say could here since we fail 
to get a nontrivial factor of mn when x = ty(mod n). However, it can be shown that if n is 
divisible by at least two distinct odd primes, then for at least half of the pairs « (modulo n), 
and y (modulo n), satisfying (4.50) with gcd(x, y) = 1, we will have 1 < gcd(a—y,n) <n. 
This classical idea of Kraitchik had seeds in the work of Gauss, but Kraitchik introduced it 
into a new century in the pre-dawn of the computer age. This idea is currently exploited 
by many algorithms via construction of these (x, y)-pairs. For instance, the QS algorithm 
uses it. More recently, the number field sieve exploits the idea. To see how this is done, we 
give a brief overview of the methodology of the number field sieve. This will motivate the 
formal description of the algorithm. 


For n = rt —s we wish to choose a number field of degree d over Q. The following choice for 
d is made for reasons (which we will not discuss here), which make it the optimal selection, 
at least theoretically. (The interested reader may consult [39, Sections 6.2-6.3, pp. 31-32] 
for the complexity analysis and reasoning behind these choices.) Set 


1))1 1/3 
d= (3 + o(1)) logn : (4.51) 
2loglogn 
Now select k € N, which is minimal with respect to kd > t. Therefore, r*4 = sr*4-t 
(mod n). Set 
m=r*,and c= sr***, (4.52) 
Then m4 = c(mod n). Set 
f(z) _ et — 6, 


and let a € C be a root of f. Then this leads to a choice of a number field, namely 
F = Q(a). Although the number field sieve can be made to work when Zla] is not a 
UFD, the assumption that it is a UFD simplifies matters greatly in the exposition of the 
algorithm, so we will make this assumption. Note that once made, this assumption implies 
that Or = Zia}. See [39] for a description of the modifications necessary when it is not a 
UFD. 

Now the question of the irreducibility of f arises. If f is reducible over Z, we are indeed 
lucky, since then f(x) = g(x)h(x), with g(x), h(x) € Z[x], where 0 < deg(g) < deg(f). 
Therefore, f(m) =n = g(m)h(m) is a nontrivial factorization of n, and we are done. Use of 
the number field sieve is unnecessary. However, the probability is high that f is irreducible 
since most primitive polynomials over Z are irreducible. Hence, for the description of the 
number field sieve, we may assume that f is irreducible over Z. 


Since f(m) = 0(mod n), we may define the natural homomorphism, 
wy: Zila] Z/nZ, 


given by 
anm € Z/nZ. 


Then 
w S¢ aja! = Seam. 
j J 


Now define a set 8 consisting of pairs of relatively prime integers (a,b), satisfying the 
following two conditions: 
[[ @tom)=e, (ceZ), (4.53) 
(a,b)ES 
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and 
Il (a+ba) = B*, (8 €Zla)). (4.54) 
(a,b)ES 
Thus, (8?) = @, so (6?) = c? (mod n). In other words, since w(67) = 7(8)?, then if we 
set w(8) =h € Z, h? = c? (mod n). This takes us back to Kraitchik’s original idea, and we 
may have a nontrivial factor of n, namely gcd(h + c,n) (provided that h #+ c(mod n)). 


The above overview of the number field sieve methodology is actually a special case of an 
algebraic idea, which is described as follows. Let R be a ring with homomorphism 


@:R>Z/nZ x Z/nZ, 


together with an algorithm for computing nonzero diagonal elements (x,) for « € Z/nZ. 
Then the goal is to multiplicatively combine these elements to obtain squares in R whose 
square roots have an image under ¢ not lying in (x, +x) for nonzero « € Z/nZ. The number 
field sieve is the special case 


R=Z xZlal, with 4(z,8) = (Z,0(8)). 


Before setting down the details of the formal number field sieve algorithm, we discuss the 
crucial role played by smoothness introduced in Definition 4.5 on page 167. Recall that a 
smooth number is one with only “small” prime factors. In particular, n € N is B-smooth 
for B € R*, if n has no prime factor bigger than B. Smooth numbers satisfy the triad of 
properties: 


(1) They are fairly numerous (albeit sparse). 
(2) They enjoy a simple multiplicative structure. 


(3) They play an essential role in discrete logarithm algorithms. 
If F = Q(q) is a number field, then by definition 
an algebraic number a + ba € Z[a] is B-smooth if |Nr(a + ba)| is B-smooth. 


Hence, a+ ba is B-smooth if and only if all primes dividing |Nr(a+ ba)| are less than B. 
Thus, the idea behind the number field sieve is to look for small relatively prime numbers 
a and 6 such that both a+ ab and a+ mb are smooth. Since (a+ ab) = a+ mb, then 
each pair provides a congruence modulo n between two products. Sufficiently many of these 
congruences can then be used to find solutions to h? = c?(mod n), which may lead to a 
factorization of n. 


The above overview leaves open the demanding questions as to how we choose the degree 
d, the integer m, and how the set of relatively prime integers a,b such that Equations 
(4.53)—(4.54) can be found. These questions may now be answered in the following formal 
description of the algorithm. 


Application 4.5 — The Number Field Sieve Algorithm 


Step 1—Selection of a Factor Base and Smoothness Bound 


There is a consensus that smoothness bounds are best chosen empirically. However, there 
are theoretical reasons for choosing such bounds as 


B = exp((2/3)?/*(log n)"/ (log log n)”/*), 
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which is considered to be optimal since it is based upon the choice for d as above. See [39, 
Section 6.3, p. 32] for details. Furthermore, the reasons for this being called a smoothness 
bound will unfold in the sequel. 


Define a set § = 8; U 82 U 83, where the component sets 8; are given as follows. 8; = {p € 
Z:p is prime and p < B}, 


82 = {uj : 7 =1,2,...,71 +r2—1, where u, is a generator of Up, }. 


(Here {r1,r2} is the signature of F’, and the generators u; are the generators of the infinite 
cyclic groups given by Dirichlet’s Unit Theorem that we presented as Theorem 3.20 on 
page 135.) Also, 


83 = {8 =a+ba € Zia] : |Nr(G)| = p < Bg where p is prime}, 
where Bz is chosen empirically. Now we set the factor base as 
F = {aj = WJ) € Z/nZ: 9 € $}. 


Also, we may assume gcd(a,;,n) = 1 for all j € 8, since otherwise we have a factorization of 
n and the algorithm terminates. 


Step 2—Collecting Relations and Finding Dependencies 


We wish to collect relations (4.53)—(4.54) such that they occur simultaneously, thereby 
yielding a potential factor of n. One searches for relatively prime pairs (a,b) with b > 0 
satisfying the following two conditions. 


(i) |a + bm| is B-smooth except for at most one additional prime factor p,, with B < 
pi < B,, where B, is empirically determined. 


(ii) a + ba is By-smooth except for at most one additional prime 6 € Z[a] such that 
|Nr(B)| = po with By < po < Bs, where B3 is empirically chosen. 


The prime p; in (i) is called the large prime, and the prime pe in (ii) is called the large 
prime norm. Pairs (a,b) for which p; and po do not exist (namely when we set p; = po = 1) 
are called full relations, and are called partial relations otherwise. In the sequel, we will 
only describe the full relations since, although the partial relations are more complicated, 
they lead to relations among the factor base elements in a fashion completely similar to the 
ones for full relations. For details on partial relations, see [41, Section 5]. 

First, we show how to achieve relations in Equation (4.53), the “easy” part (relatively 
speaking). (This is called the rational part, whereas relations in Equation (4.54) are called 
the algebraic part.) Then we show how to put the two together. To do this, we need the 
following notion from linear algebra. 

Every n € N has an exponent vector v(m) defined by n = []j2, DP; where p; is the j*” 
prime, only finitely many of the v; are nonzero, and 


v(n) = (v1, V2, - : ) = (vj) F241 


with an infinite string of zeros after the last significant place. We observe that n is a square 
if and only if each v; is even. Hence, for our purposes, the v; give too much information. 
Thus, to simplify our task, we reduce each v; modulo 2. Henceforth, then 0; means v; 
reduced modulo 2. We modify the notion of the exponent vector further for our purposes 
by letting By, = 7(B), where 7(B) is the number of primes no bigger than B. Then, with 
po = —1,a+bm=]]j25 p; is the factorization of a + bm. Set 


v(a + bm) = (%,..-,UB,); 
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for each pair (a,b) with a+ ba € 83. The choice of B allows us to make the assumption 
that |S3| > B, +1. Therefore, the vectors in v(a + bm) for pairs (a,b) with a + ba € 83 
exceed the dimension of the F2-vector space ne 1+! Tn other words, we have more than 
B,+1 vectors in a B, + 1-dimensional vector space. Therefore, there exist nontrivial linear 
dependence relations between vectors. This implies the existence of a subset J of 83 such 
that 

S- v(a+ bm) =06 FP*1, 

at+baceT 

so 

[] @tom)=2 (eZ). 


at+tbaEeT 
This solves Equation (4.53). 


Now we turn to the algebraic relations in Equation (4.54). We may calculate the norm 
of a + ba by setting x = a and y = b in the homogeneous polynomial (—y)4f(—2/y) = 
x? — c(—y)*, with f(x) = 24 — c. Therefore, Nr(a+ ba) = (—b)4f(—ab-1) = at — e(—b)?. 
Let 

Rp ={r¢€Z:0<r<p-1, and f(r) =0 (mod p)}. 


Then for gcd(a,b) = 1, we have Ne(a + ba) = 0(mod p) if and only if a = —br(mod p), 
and this r is unique. Observe that by the relative primality of a and 6, the multiplicative 
inverse b~! of b modulo p is defined since, for b = 0(mod p), there are no nonzero pairs 
(a,b) with Ne(a + ba) = 0(mod p). 

The above shows that there is a one-to-one correspondence between those 8 € Zia] with 
|Nr(G)| =p, a prime and pairs (p,r) with r € Rp. Note that the kernel of the natural map 
w: Zia] Z/pZ is ker(w) = (a + ba), the cyclic subgroup of Z[a] generated by a+ ba. It 
follows that |Z[a] : (a + ba)| = |Nr(a + ba)| = p, so Zla}/(a + ba) is a field. 

This corresponds to saying that the Z[a]-ideal P = (a+ ba) is a principal, first-degree prime 
Z{a]-ideal, namely one for which Np(P) = p' = p. Hence, Z[a]/P = Fy, the finite field of p 
elements. 

The above tells us that in Step 1 of the number field sieve algorithm, the set 53 essentially 
consists of the first-degree prime Z[a]-ideals of norm Np(P) < Bz. These are the smooth, 
degree one, prime D -ideals, namely those ideals whose prime norms are Bj-smooth. 


In part (ii) of Step 2 of the algorithm on page 177, the additional prime element 8 € Z[a] 
such that |Nr(G)| = po with By < pa < Bs corresponds to the prime O-ideal Pz called 
the large prime ideal. Moreover, Pz corresponds to the pair (p2,c(mod p2)), where c € Z is 
such that a = —bc(mod pz), thereby enabling us to distinguish between prime ideals of the 
same norm. If the large prime in Step 2 does not occur, we write Pz = (1). Now, since 


|a+bm| = [[ pv”, 
peEsi 


and 


la + ba| = II um II Ste (4.55) 


ue S82 seES3 


for nonnegative t,,v; € Z, and since w(a + bm) = (a+ ba), then 


I] v™” = [[ v@" [L vo”. 


pes, UeS82 sES83 
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in Z/nZ. Therefore, we achieve a relationship among the elements of the factor base F, as 


follows 
IT e@® TL 4” = [] ¥@) (mod n). (4.56) 
UweS82 sE83 pes, 

Furthermore, we may translate (4.55) ideal-theoretically into the ideal product 


Ja+ bol = [J uw’ TT a. (4.57) 


u€82 PES3 


where ? ranges over all of the first-degree prime Z[a]-ideals of norm less than Bz, and mp 
is a generator of P. 


Thus, (4.56) gives rise to the identity 


I] v@)” = Tf ve TL oo)”. 


pes, ueS82 PES3 


If |S3| > 7(B), then by applying Gaussian elimination for instance, we can find x(a,b) € 
{0,1} such that simultaneously 


2 
II (a + bay?) = (( II *) (11 “)) , 
a+ba€$3 UES sE83 


2 


T] tom = [[o?] ] . 


atba€$83 pes, 


and 


hold. From this a factorization of n may be gleaned, by Kraitchik’s method. 

Practically speaking, the number field sieve tasks consist of sieving all pairs (a,b) for b = 
by, b2...,b, for short (overlapping) intervals [b1,b2], with |a| less than some given bound. 
All relations, full and partial, are gathered in this way until sufficiently many have been 
collected. 


The big prize garnered by the number field sieve was the factorization of Fo, the ninth 
Fermat number, as described in [41]. In 1903, A.E. Western found the prime factor 
2424833 = 37-216 +1 of Fy. Then in 1967, Brillhart determined that Fo/2424833 (having 
148 decimal digits) is composite by showing that it fails to satisfy Fermat’s Little Theorem. 
Thus, the authors of [41] chose 


n = Fo/2424833 = (2°17 + 1) /2424833. 


Then they exploited the above algorithm as follows. If we choose d as in Equation (4.51) 
on page 175, we get that d = 5. The authors of [41] then observed that since 2°! = —1 
(mod n), then for h = 27°, we get h® = 210° = 2. (2512)? = 2(mod n). This allowed them 
to choose the map w : Z[¥/2] 1 Z/nZ, given by w : /2 +> 279. Here Z[*/2] is a UFD. 
Then they chose m and c as in Equation (4.52), namely since r = 2, s = —1, and t = 512, 
then the minimal k with 5k = dk > t = 512 is k = 103, and m = 219, so c = —8 = 25103 
(mod n). This gives rise to f(x) = 2° +8 with root a = =4/2". and Za] C Z[/2]. Observe 
that 8Fy = 2515 +g = (2193)° 4.8. Thus, b(a) = m = 2103 = —2615 = — (2205)° (mod n). 
Notice that 21° is small in relation to n, and is in fact closer to ~/n. Since 


v(a+ ba) = at 2'b € Z/nZ, 


180 4. Applications: Equations and Sieves 


we are in a position to form relations as described in the above algorithm. Indeed, the 
authors of [41] actually worked only in the subring Z[a] to find their relations. The sets 
they chose from Step 1 are 8; = {p € Z: p < 1295377}, 


a oe 
So = {-1,-14 ¥2,-14+ 72 — ¥2 + 9%}, 
for units uy = —1, w= —-1+ ¥/2, and uz = —1+ w/e _ 3/2? + 2, and 
83 = {6 € Zia] : |Nr(G)| = p < 1294973, p a prime}. 


The authors began sieving in mid-February of 1990 on approximately thirty-five worksta- 
tions at Bellcore. On the morning of June 15, 1990 the first of the dependency relations 
that they achieved turned out to give rise to a trivial factorization! However, an hour 
later their second dependency relation gave way to a 49-digit factor. This and the 99-digit 
cofactor were determined by A. Odlyzko to be primes, on that same day. They achieved: 
Fo = q7 - Gag: G99, Where q; is a prime with j decimal digits as follows: q7 = 2424833, 


dag = 7455602825647884208337395736200454918783366342657, 
and ggg = 741640062627530801524787141901937474059940781097519 
023905821316144415759504705008092818711693940737. 


Fermat numbers have an important and rich history, which is intertwined with the very 
history of factoring itself. Euler was able to factor F5. In 1880, Landry used an idea 
attributable to Fermat to factor Fg. As noted above, Fy was factored by Pollard. Brent 
and Pollard used a version of Pollard’s rho-method to factor Fg (see [53, pp. 206-208] for 
a detailed description with examples of the rho-method). As we have shown above, F5 was 
factored by the number field sieve. Lenstra’s elliptic curve method was used by Brent to 
factor Fig and F\;—see [52, pp. 522-524]. Several other Fermat numbers are known to 
have certain small prime factors, and the smallest Fermat number for which there is no 
known factor is Fy4. On March 27, 2010 Michael Vang found the sixth known factor of 
Fy2: 17353230210429594579133099699123162989482444520899 - 2/5 +1. On March 26, 2010 
David Bessell found the factor of F2: 3853959202444067657533632211 - 224 +1. No factor 
of the 1262612-digit F2 was previously known. On February 3, 2010 Tapio Rajala found 
the factor of F,4: 1784180997819127957596374417642156545110881094717 - 24° + 1. For 
updates on prime factors of Fermat numbers, see the website: 


http://www.prothsearch.net /fermat.html. 


Exercises 


4.44. Let n,d € N and m = |n'/4|, with n > 2”. Write n to base m via integers c; € 

{0,1,...,m—1} for 7 = 1,2,...,d, namely 

d 
n= S- cm =coteam+-.-+ ca_im?! + cam?. 
j=0 
Prove that cg = 1, and cg_1 < d. (The polynomial 
f@) = 49 + caaal! poo tidy 
is the polynomial used in the general number field sieve. See [7].) 


4.45. Use the number field sieve to find two prime factors of 2!°° + 3. 
4.46. Use the number field sieve to find a prime factor of 248° + 1. 


Chapter 5 


Ideal Decomposition in Number 
Fields 


At his best, man is the noblest of all animals; separated from law and justice he is the 
worst. 

Aristotle (384-322 B.C.) 
Greek philosopher 


This chapter builds upon the ideas developed for quadratic fields in Theorem 1.30 on 
page 49 and the discussion surrounding it. We extend the notions and definitions given 
in Remark 1.24 on page 52 to arbitrary number fields and link this with the Galois theory 
developed in §2.1. 


5.1 Inertia, Ramification, and Splitting of Prime Ideals 


If K/F is an extension of number fields, namely | : F'| < oo, and |F': Q| < «0, we call Ka 
relative extension of F. If F = Q, then K is called an absolute extension. Our main interest 
continues to be the number rings, so we now look at the interplay among the ideals of Op 
and those of 9x. We remind the reader of the notation for the class group and discussion 
surrounding it in Remark 3.7 on page 100. Since D7 C Ox, we may consider the map 


LK/F : Ia, > Inc, 


given by 

Ukj/p: JH IDK, (5.1) 
where JOx is the smallest fractional O,-ideal containing J. This consists of all sums 
vr, aj 8; with n € N, a; € J, and 8; € Ox for j = 1,2,...,n. This is also called the 


j 
fractional ideal generated by J in Ox. It follows from Theorem 1.17 on page 28, that 


where the P; are distinct, prime 0 ,-ideals, and e; € Z are nonzero, and possibly negative 
for 7 = 1,2,...,r. By Exercise 5.1 on page 194, 


JOnNnF =4, 
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and by Exercise 5.2, 4p is a group monomorphism that induces a mapping?! 
Ux/F : Cop +> Cox, (5.2) 


given by 
CK/F: In kee 


j=l 


Remark 5.1 We are mainly interested in the case where J is a prime O--ideal and its 
decomposition in extension fields, since the prime ideals are the generators of the class group 
as demonstated in Remark 3.7. 


Definition 5.1 — Ramification, Inertia, and Decomposition Numbers 


Let K/F be an extension of number fields, and let p be a prime D-ideal with 
g 
pOx = ie e; EN 
j=l 


where the P; are distinct, prime Ox-ideals. We say that the prime Ox-ideals P; lie over 
p, or are above p. Also, p is said to lie under the P;. 


The number e; is called the ramification index of P; in Ox, denoted by 
ex/F(P;). 


Also, P; is said to be ramified in Ox if ex;r(P;) > 1, and p is also said to be ramified in 
Ox as well. Furthermore, p is said to be unramified in O~ provided that ex/p(P;) = 1 for 
each j = 1,2,...,g. The number g is called the decomposition number of p in Ox, denoted 
by 

9K/F(P): 
The degree |Ox%/P; : Or/p| is called the inertial degree, or relative degree, of P; in Ox, 
denoted by 

fer (P5). 
The fields Ox /P; and O-/p are called the residue class fields or simply residue fields at 
P; and p, respectively. Thus, fx/r(P;) is the degree of the extension of these finite fields. 


A useful fact that we will need in what follows is the next result using the above notions. 


Lemma 5.1 Let K/F be an extension of number fields and let P be a prime 9x-ideal, 
then there exists exactly one 0 r-ideal p lying below P. 


Proof. Since 1 ¢ PN Op, then PN O- is an O-p-ideal with Op 4 PNA OP, and PA O- is 
nonzero since Nx /p(a) € PA Of for all a € P. Also, given that PN Op C P, then this 
induces an embedding 


w: Or/(PNOF) + OK/P, 


5.1The term induces here may be interpreted as “gives rise to,” which means that the mapping in (5.1) 
gives rise to the well-defined mapping in (5.2) by moving to quotient groups. 
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and since Dx /P is a field by Theorems 1.11 on page 18 and 1.26 on page 42, then as a 
subring embedded in it, Or/(PMO-r) must be an integral domain by Theorem 1.9 on 
page 17, so PN Of is a prime O--ideal. Since 


tKjrF(PN OF) = (PADF)OK & POKRANDK = PR, 


then P lies over PM Op. If p is another prime O--ideal below P, then p C PND =z, so 
p = POOF, by Condition B of Definition 1.23 on page 25. 


Example 5.1 Let us consider the ideals in Example 2.14 on page 84. We have the Dx = 
Z[V10]-ideal (2)Z[V10] = P? where P = (2,10), so the prime ideal (2) in Or = Z is 
ramified in Ox. Since 


(3)Ox = (3,1 + V10)(3, 1 — V10) = PP’, 


then the ramification indices of P and P’ are 1, so 3 is unramified in Ox. Its decomposition 
number is 2. Lastly, (7)Ox = P a prime Ox-ideal since |Ox/P : Or/(7)| = 2, its inertial 
degree in Kk. 


There is an easier way to determine the relative degrees of primes in extensions via poly- 
nomials in certain circumstances by way of Exercise 5.4. 


Example 5.2 Let K = Q(W2) = Q(a) and F = Q. Then by Exercise 4.33 on page 173, 
On = Z[x/2]. For p = 7, we have that 


g—-2= Mea,o(£) 


is irreducible modulo 7. Therefore, (7)0«% = P, where P is an Ox-prime ideal with 
ex/F(P) =1 = g9x/r(7) and fx/r(P) = 3, so 7 is inert in K by Exercise 5.4. 
If p = 29, then 

g® —2= (a+ 3)(x? + 262 — 20) (mod 29), 


where x? + 262 — 20 is irreducible modulo 29 so by Exercise 5.4, 
(29 DK = Pi Po, 


where the fxjr(P1) = 1, and fxjr(P2) = 2, €x/F(P1) = €x/F(P2) = 1, and 9K /F(29) =2. 
Thus, 29 is unramified in Dx. 
If p= 31, then 

x? —2=(x2—4)(2—7)(2+11) (mod 31), 


so by Exercise 5.4, 
(31)DK = Pi PoPs, 


where expr (Pj) = fyr(Pj) = 1 for 7 = 1, 2,3, and gx/r(31) = 3, so 31 is completely split 
in Ox. 


Some properties of ramification and inertia are given in the following. In the sequel, a 
tower of number fields F C K C LE means that F', K, and L are number fields, with LZ an 
extension of K, and K an extension of F. 
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Theorem 5.1 — Transitivity of Ramification and Inertial Degrees 
Let F C K C L bea tower of number fields, and let 9 be a prime D_,-ideal above the prime 
Ox-ideal P. Then 
er/K(Q)ex/r(P) = ex/r(Q), 
and 
fis (2) fxse(P) = frr(Q). 


Proof. The transitivity of the inertial degrees follows directly from Definition 5.1. To see 
this, let p be the prime OD -ideal below P. Then 


fix (Q)fxjr(P) = |91/2: Ox/P||Dx«/P : OFr/p| = 


\O,/2:ODr/p| = frsr(Q). 
Also, since p C P C Q, then 


ep/r(Q) a en/K(Q)ex/F(P). 


The reader may now recall Theorem 1.30 on page 49, the quadratic case, which we will use 
in the following illustration—see also Remark 1.24 on page 52. 


Example 5.3 Let L = Q(/—I, V10), K = Q(V10), and F = Q. Then by Theorem 1.30, 
we have for p = 5 that 

pO, =o Pips: 
where P; and P2 are prime O,-ideals with ep/xK(P1) = er/K(P2) = 1, and ex;r(pi) = 
€x/F(P2) = 2 where P; ia OK = pj for 7 = 1,2. Thus, 


epjr (Pj) = ersK(P;)exsr (py) = 2. 


Also, if p = 3, then by Theorem 1.30, p is completely split in K and is inert in Q(/—1). 
Therefore, 307 = 9,92, where Q; for 7 = 1,2 are prime Oy-ideals, and fr/%(Q;) = 2 for 
j =1,2, while fx/r(q;) = 1 where 9; 1D % = qj. Hence, for j = 1,2, 


fuse (Q9) = feyw (95) fxs (az) = 2. 


We will now develop tools that will allow us to refine our knowledge of the ramification, 
inertial, and decomposition numbers, especially as we tie them into the theory developed 
in the preceding chapters. First, we extend the notion of trace and norm. 


Definition 5.2 — Relative Norms and Traces of Elements 


Let A/F be an extension of number fields with |K : F| =n, and let 6; for 7 = 1,2,...,n 
be all of the F-isomorphisms of K—see Exercise 2.6 on page 63. Let a € K and set 


Nx (a) = |] 4;(a), 


j=1 


called the relative norm of a in K/F. Also, set 


called the relative trace of ain K/F. Observe that when F' = Q, then these notions coincide 
with those given in Definition 2.4 on page 65, and in this case, we call Nx /g the absolute 
norm and Tijq the absolute trace. 
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Example 5.4 Let K = Q(/—1, V3), and F = Q(V3). Then 
Nxr(5 + V—1) = (5+ V—-1)(5 — V-1) = 26, 


and 

Nxjo(5 + V—-1) = Nixp(5+ V—1) = 267 = 676. 
Also, 

Txp(5 + V—1) = (54+ V—1) + (5- V-1) = 10, 
and 


TK/o(1 + V—1) = 2TK/r(1 + V—1) = 20. 
Example 5.4 motivates the following, which uses the ideas developed in Exercise 2.6. 


Theorem 5.2 — Properties of Relative Norms and Traces 
If F C K C Lis a tower of number fields, then for a € L the following hold. 


(a) Nzyr(@) = Nx/r(Nz/K(@)), and Nz/p(a) € F. 
(b) Tr(a) = Tx/r(Tr/K(@)), and Tr(a) € F. 
(c) If |L: F(a)| =r, then 
Nyjr(@) = (Near (@))", and Ty/p(a@) = 7(Tr(a)/r(@))- 


Proof. (a) Let 0; for 7 =1,2,...,n =|L: K| be all of the K-isomorphisms of L and let W, 
for k = 1,2,...,m=|K : F| be all of the F-isomorphisms of K. Then 


m 


Nxyr(N1/K(a Fuk Wel ‘LU 4;(a)) = [J [] ve @(@)) = Nryr(e), 


k=1j=1 
since the 7,6; are all distinct and ee the F-isomorphisms of L. Observe as well that 
if ~, is the identity embedding of K, then 6;|K = y for all j = 1,2,...,n, and that ~, 
extends to n embeddings of L into C for each k = 1,2,...,m 


(b) The property for the trace is proved in a similar fashion to that of (a), employing 
additivity instead of multiplicativity. 


(c) These formulas are proved in the same fashion as that given in the proof of Theorem 2.5 
on page 66. 


Example 5.5 Let L = Q(V5, /—1), K = Q(V—1), and F=Q. Ifa= V5+ V-1, then 
Nxjr(NzjK(@)) = Nejr((V5 + V—-1)(-V5 + V—1) = Nxjr(—-6) = 36 = 
(V5 + V—1)(—V5 + V-1)(V5 — V=1)(—V5 - V=1) = Nz/r(a). 


Also, 
TK/F(Th/K(@)) = 


Txjr((V5 + V1) + (-V5 + V=1)) = Tyr (2V—1) = 2V-1 - 2V-1=0= 
(V5 + V=1) + (-V5 + V=1) + (V5 — V=1) + (-V5 - V1) = Tr(a). 


If 8@=3+-—1, then 


and 
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The following makes use of Lemma 5.1 on page 182 to introduce a new notion. 


Definition 5.3 — Relative Norms of Ideals 


Let K/F be an extension of number fields, and let P be a prime Dx-ideal above the unique 
prime OD p-ideal p= PN Op. Set 


N¥/F (9) = pik/PO) 


and extend to J € Ia, via 
n 


NK/Fg =I aj fxr (Pj) 


where 


as a product of distinct prime powers in Ox and Pj 1 Or = pj. When F = Q, 
N*/Q(9) = (N(J)), 


the principal ideal in Z generated by N(J) as given in Definition 2.8 on page 83. We call 
N*/° the absolute norm. 


Definition 5.3 tells us, in particular, that N*/"(J) is an D p-ideal for any Dx-ideal J. The 
reader may develop further properties of the relative norm of ideals by solving Exercises 
5.3-5.6. 


Example 5.6 Let K = Q(/—3, V5), F = Q(V5), and p = 11. Then pOx = P1Po, 
where ?; for 7 = 1,2 are distinct prime Ox-ideals, and ex;r(P;) = 1, fsr(P;) = 2 by 
Theorem 1.30 on page 49, and Theorem 5.1 on page 184. Hence, 


NK/F(P,) = ps, where p; = P; M Op. 


Also, 
N*/Q(P,) = (11)?, 


since p; 1 Z = (11). 


What is hidden in the development thus far is the relationship between | : F'| and the 
ramification and inertial degrees. 


Theorem 5.3 — Field Degrees, Ramification, and Inertia 
Let K/F be an extension of number fields. Suppose that p is a prime D-ideal and 
pOrK = Pr ene 


where the P; are distinct prime 0 x-ideals, and g = gx /r(p). Then for e; = ex ;p(P;), and 
5 = fpr (95), 


g 
doeifs = 1K FI 
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Proof. The embedding of Op into Ox induces an embedding of the field D;-/p into the 
ring Ox /pOK. We now show that the dimension of the ring as a vector space over the field 
is indeed |K : F | =n, and that this is also the required sum. 


Claim 5.1 |Dx«/P;’ : Or/p| = e; fj. 
By Exercise 2.40 on page 82, we have 
Dc /P;’ Dx] P;| =e; 
Therefore, 
|Dc/P;’ :Or/p| = |Dc/P5? :OK/P;\|O«/P; : Or/p| = ej fj, 


which establishes Claim 5.1. 
By Claim 5.1 and Exercise 2.40, 


g 


\Dc/pOK« : Or/pOr| =S_ 65 fj. (5.3) 


get 


It remains to show that this dimension is also n. First, we show that it is at most n. We 
do this by demonstrating that any n+ 1 elements of Dx /pD x are linearly dependent over 
Or/pOr. Let a; for j =1,2,...,n+1 be elements of Ox and let @; be the corresponding 
elements of Or/pOpr. Since the a; are linearly dependent over F’, then they are linearly 
dependent over Or by Lemma 1.4 on page 38. Therefore, there exist 6; € Op not all zero 


such that 
n+1 


Ss ja; = 0. (5.4) 
j=l 


Claim 5.2 There existsay € F—Op with y((1,..., 8n41) C Or, but y(S1,.--, Bn4i) Z p- 


By Exercise 1.38 on page 33, there is a non-zero 9 -ideal I such that I((1,...,Gn41) = (@) 
for some a € Op. Thus, [(81,...,;8n41) Z ap, since otherwise a € ap implies 1 € p. Let 
6 €I such that 8(61,...,8n41) Zp. Then by setting y = 8/a, we get the claim. 
By Claim 5.2, reducing (5.4) modulo p yields a nontrivial relation among the a;. In other 
words, not all G;are zero modulo p, so the a; are linearly dependent over Or /pOr. Hence, 
we have shown that 

lIDn/pOr. : Or/pO p| <n. 
We conclude by establishing the full equality. 


Let pM Z = (p), and let pz for k = 1,2,...,gr/o(p) = gi be all of the prime O-ideals 
above p. Now we show that n= ng = |Ox/prOK : Or/px| for each k = 1,2,...,91. 


Claim 5.3 30%), erjq(P;)fro(ps) = |F : Ql. 


We have ‘ 
1, 
N¥/2(p9 p) = [[@)re8? frees) = (p)=i=1 erolPs)fr/a(Ps) 
j=l 


and by Corollary 2.8 on page 85, this equals 


|D-/(p)| = (Ne(p)) = (PFA. 
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Since N¥/2(pO-) = N(p) = (\Or : (p)|) = (Ne(p)) by Definition 5.3 on page 186, this 
establishes Claim 5.3. 
Therefore, since pO r = [[7_, p ey , then using (5.3), 


G1 
N¥/Q (pO x) = [s/o (p29) ))er/olPe) -T NF/Q(p,,)rReF/a(Pe) — 
k=1 
G1 
[] @reeer tree) = (p)=e=1 Mnero(Pr)fr/a(Pr) 
k=1 


by Claim 5.1 and Exercise 5.6 on page 195. However, by the same reasoning as in Claim 


5.3, 
N¥/8(p9 x) os (p)=3=1 ex/a(P3)fajo(Pj) — (p) FeO, 


Therefore, 
G1 
|K: Q\= S Nee r/o(Pe) fr/o(Pr); 
k=1 
so 


nF: Q\= Sarr Pr) fr/o(Pr) nied Pe) frjo(Pr) = 


k=1 k=1 


g 
S > nero (Pr) fF/o(Pr) =|K:Qi=n|F:Q. 


k=1 
Thus, nz = n for each k = 1,...,g9;. In particular, for p, = p, the equality holds. This 
completes the proof. 


In view of Theorem 5.3, we may extend the notions given in Definition 5.1 as follows. 


Definition 5.4 — Inert, Completely Split, and Totally Ramified 
Let K/F be an extension of number fields, and let p be a prime D-ideal with 


g 
pox =|[P7, a EN 


where the P; are distinct, prime Ox-ideals. Then p is said to be completely ramified, or 
totally ramified in O~ whenever 


ej = exF(P;) =|K : F| for some j = 1,2,...,9, 
so fixyr (Pj) =1=9K/r(P;). p is said to split completely, or to be completely split in Ox 
if 
9 = 9xjr(p) = |K: Fl, 


so ex/r(Pj) =1 = fxsr(P;). If fxsr(P;) = |K : F| for 7 =1,2,...,g, then p is said to be 
inert?-? in Ox, so éx/r(P3) =1= gx (Pj). 


5.2T¢ is a common and accepted abuse of language in the literature to say that p ramifies, splits or is inert 
in K, rather than 9,. 
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Example 5.7 Consider the situation given in Example 5.2 on page 183. For p = 29, 
(29) De = (29)Z[W2] = Pi Po, 
for prime D x-ideals P,, and Pz, where ex/g(P;) = 1 for 7 = 1,2, fxsa(P1) = 1, fxjo(P2) = 
2, and g(29) = 2. Thus, 
g 
|Q(v2) :Q| =|K: Fl = ef) =1-141-2=3. 
j=l 


The reader is reminded of the definition of normal extension given in Exercise 2.2 on page 62. 
For such extensions, Theorem 5.3 on page 186 is given as follows. 


Theorem 5.4 — Normal Extensions, Ramification, and Inertia 


Let K/F be a normal extension of number fields, and let p be a prime 0 -ideal with 


where g = gx/r(p) and e; = ex /p(P;). Then 
ex/F( Pj) = exsr(Pe) = ex/r(p), and fr/r(Pj) = fxyr(Pr) = fxsr(P) 
for all j,k € {1,2,...,g}. Thus, 
ex/F(P) fx/F(P)gK/F(p) =n =|K: Fl. 


Proof. The last assertion will follow as an immediate consequence of the initial results via 
Theorem 5.3. 


If we can show that for each P; and P; for any j,k € {1,2,...,g}, there exists an F- 
isomorphism 6 of K such that 6(P;) = P,,°% then the initial assertions follow. To see this, 
suppose that 0(P;) = P,. Then 


g 
pOxK = O(pOK) = Ile 


then e; = ex by uniqueness of factorization of ideals. Also, 
Se =|9x«/Px: Or/p| = |OK/O(P1) : Or/p| = |Ox/P1 : Or/p| = 


Hence, it remains to show that the P; are conjugates over F’. 


Let @ be any F-isomorphism of kK. Since hes = (a) for some a € Ox, then a € Py since 
P, is prime. Since 


Nxjr(a@ = La pe Fis 
where 6; for 7 = 1,2,...,n are all of the ats of K, then 


Nx/r(@Q)Ox © pO, 


5.3When this occurs, we say that the Pj are conjugates over F. The reader may easily verify that 0(?;) is 
a prime 9 -ideal, so 6(P;)NO p=p forcing o(P;)=";,, for some keN—see Exercise 2.40 on page 82. 
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so pOK | Nx/r(@)Ox, which in turn implies that P, | (Nx/r(@)) for all k € {1,2,...,n}. 
Thus, for some ¢ € {1,2,...,n}, Ae(a) € Py. Therefore, 


O¢(P1)"* = O(a)Ox C Pr, 


from which it follows that 6¢(P1) = P;, since both P, and P, are primes. Hence, the P; are 
all conjugates over F’. 


The action of the F-isomorphisms of K on the prime 0x-ideals established in the above 
proof has a name. We also say that the F-isomorphisms of K transitively permute the P,, 
or act transitively on them. Thus, we have the following immediate consequence. 


Corollary 5.1 If K/F is a normal extension of number fields, then the F-isomorphisms of 
K transitively permute the prime 9 x-ideals above a fixed prime ideal p in Dp. 


Example 5.8 Let p* > 2 where p is a rational prime and k € N. Set K = Q(¢,«) and let 
A=1—( «. Then (A) = AOx is a principal Dx-ideal, and is prime since 


NRIPODK) = (P), 


by Corollary 2.8 on page 85 and Exercise 3.35 on page 129. Furthermore, since p = ®,x (1) = 
[t= s .), where the product ranges over all natural numbers j < p* relatively prime to 


p, and by Exercise 3.35, we get p = ur") where u € Ox isa unit. Thus, 
POK = (ADK), 
so since A’/Q is normal, we get 
ex/o(P) = o(p") = |K : Ql, and fx/o(p) = 1 = gx /0(?)- 


We give an interpretation of the relative norm of an ideal that is similar to the relative 
norm of an element. We will employ the Galois theory developed in §2.1. 


Theorem 5.5 — Ideal Norms as Conjugates 


Let K/F be an extension of number fields, and let Z be the minimal normal extension of 
F containing K. Set H = Gal(L/F)/Gal(L/K).°* Then for J € I,,., 


N¥/F (DOr = [J 00902). 
0€H 
In particular, if K/F is a normal extension, then 
NE/FQOx«= [I 0). 


0€Gal(K/F) 


Proof. By Exercise 5.3 on page 194, it suffices to prove the result for J = ?, a prime Ox- 
ideal. Let p= POO-p. First we prove the result for K/F a normal extension, namely 
kK =L. By Theorem 5.4 on the preceding page, 


pOrK = (P,P. aa Pe 


5.4The reader is cautioned that the set H is not, in general, a group. 
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where P = Pi, € = exr(p), f = fx/r(p), and g = 9x/r(p). Since the F-isomorphisms of 
kK transitively permute the P;, for 7 = 1,2,...,g, and since 
efg=n=|K: Fl, 
then for each such 7 there are ef of these isomorphisms that send P, to P;. Therefore, 
g e 
WeM= TL mh =][ ee = eon) = NX" (9) 9x. 
6€H 0€Gal(K/F) j=l 


This completes the proof for the case where K = L. 
In the general case, if 0,07' € H, then 0;(PDz,) = 62(PD_,). Therefore, 


|L:K| 
(TL a2)) = [J P91) = N*/* (P91) 91, 


0€H 0€Gal(L/F) 


by the above case, and by Exercises 5.6-5.7, this equals, 


K/F(\L/K _ yy K/Frp\|L:iKle. — (,7K/F |L:k| 
NOT (NE! (POLO = NO (POM OL = (NO (POL ’ 


and the desired result follows. 


Corollary 5.2 Assuming the hypothesis of Theorem 5.5, let J = (a) € Pa,. Then 
N*/F (9) € Pa,, is the principal fractional 0 p-ideal generated by Nx/p(a). 


Proof. From Theorem 5.5, we get 


N¥/F (9, = [J 0092) = [] 6(e92) = Nix (az. 


OCH 0€H 


Therefore, by Exercise 5.1, 


N*¥/FJ) = NK/F) OL OK = Nxr(@)OL OK = Nejr(@Or, 


which is the required result. 


Example 5.9 Let K = Q(2), which is not normal over Q as observed above. However, 
L = Q(¢3, V2) is normal over Q, where ¢3 is a primitive cube root of unity. In fact, it 
is the minimal normal extension of Q containing kK. The embeddings of L into C are 
{1, 01, 02, 62, 0102, 67.02} where: 


01: V2 GV, and 0:4 G, 


02: 63+ G", and 02: V2 V2. 
As shown in Example 5.7 on page 189, we have the 0 x-ideal 


(29) Dx = Pr Po, 


with fx/o(P1) = 1 = exyjo(P;) for 7 = 1,2, and fx/o(P2) = 2. Also, 29 is inert in 
Q(¢3) = Q(V—3) by Theorem 1.30 on page 49. Therefore, by Theorem 5.1 on page 184, 


(29) 91, = 2199s, 
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where the 9, for 7 = 1,2,3 are O;-ideals with 


fis (Q1) = 2, and fr/%(Q;) = ex; (Qj) = 1, for j = 2,3. 
Also, here 
H = Gal(L/Q)/Gal(L/K) = (01) = {1,01, 03}, 


where 6 is the image of 6, under the natural map that takes Gal(Z/Q) to the set H. Thus, 
again by Theorem 5.1, 


91(Q1) = Qi, 01 (Q2) = Qs, and 0, (Q3) = Qo. 


Hence, 
3 


N¥/2((29) 9K) Or = N¥K/2(P,P2)O = LLC (21993) = 930803. 


Notice that by Definition 5.3 on page 186, 
N¥/2((29) Dx) = N¥/2(P,P2) = N¥/O(P,)N*/O(P,) = 
(29)!x/elP1) (29)Fxra@a) = (29) . (29)? = (29), 
which coincides with the new characterization of relative norms for ideals, since 
N¥/2((29) 9 K)O, = 297 D, = (Q19993)°. 
If we consider the norm from L, then as in the proof of Theorem 5.5, we get, 


N¥/2((29) 97) O71 = II 0(91 993) = 
0€Gal(L/Q) 


(11 H03040)) = (21993)°. 


0c¢H 


Observe that, since L/Q is normal, then e79(29) = 1, frye(29) = 2, and gz/9(29) = 3. 
Again, by our original Definition 5.3 on page 186, we get 


3 3 
N*/2(Q1 993) = |] N*/2(9;) = [J (29)? = (29)°, 


g=1 j=l 
so we achieve, as above, that 


N*/2((29) 9) D1 = 29°, = (Q19293)°. 
Yet another way to see this is to use Exercise 5.6 on page 195 and Definition 5.3 to get, 
3 
N¥/2(Q,9203) = N¥/2(N*/* (Q,0)03)) = N¥/2 | [] N*/*(Q;) 
j=l 
N&K/@ (eae ee eee _ N¥/Q(p2-PPy) _ N¥/Q(p2—2) - 


(29)24x/0(P1) (99)2fxe(P2) — (29)?1(29)?? = (29)°. 


All of the above methods are instructive, but the easiest is to look at Corollary 5.2, from 
which we get that N*/2((29)9x), respectively, N’/2((29)Dz), is the principal Z-ideal 
generated by Nx/g(29) = 29°, respectively, Nz /9(29) = 29°. 
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Corollary 5.2 allows us to achieve yet another characterization for the norm of ideals. 


Theorem 5.6 — Norms of Ideals Generated by Norms of Elements 


Let K/F be an extension of number fields. If J € I,,., then N*/*(J) is the smallest ideal 
of IA, which contains all norms Nx /p(a@) where a ranges over all elements of J. 


Proof. By Corollary 5.2, Nx;p(a) € N*/F (J) for all a € J. It remains to show that the 
Nx/r(@) generate N*/F (J). First, we assume that J is an integral D ,-ideal. 


Claim 5.4 There exist a,3 € J with (a) + (8) =J. 
Suppose that H is an Dx-ideal relatively prime to J = ay Pp; and let 


: a; ajt+l 
aj epi — py. 


Also, by Exercise 1.38 on page 33, there is an O,-ideal J, such that aD xn = JI, for some 
a € Ox. Then by Theorem 1.21 on page 32, there is a solution y = ( to the system of 
congruences 

y=a,; (mod pe) for 7 = 1,2,...,n, 


and y=1 (mod 1H). 
Therefore, 8 € J and we may set BOK = JIg where Ig is an Dx-ideal with 
In + hHI=On Ch+h, 
so I, + Ig =O. Hence, 
ADK + BOxn =I +I =I(h+h) =J, 


which secures the claim. 


By Claim 5.4, N¥/*(I,) and N*/* (Iz) must be relatively prime since J; and Iz are rela- 
tively prime implying that N*/"(I,)Ox and N¥/¥ (I)D x are relatively prime. Also, 


N¥/F (aD) = N¥/F(9)N*/F (11), 


and 
N*IF (80x) = N¥/F(I)NE/P(D). 
Thus, N¥/F (I,) = N¥/F (qI-19 x) and N¥/F (In) = N¥/¥(BI-1D x) are relatively prime. 
Hence, 
N¥/F (aD x) + NE/F (BOK) = N*/P(D)(N*/F (1h) + N¥/P (Ig) = N*/F(9), 


and this completes the proof for the integral case. 


If J is any fractional 9 x-ideal, then J = y~!J for some y € Ox and some integral D x-ideal 
J by Remark 1.13 on page 26. However, 


Nxyjr(y)y' =6 € Ox, 


since + | Nx/r(7). Thus, 
J=6J(5y) 19x = Ho, 


where H = dJOx is an integral Dx-ideal and 


o = by = Nxjr(y) € Or. 
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By Exercise 5.3 and the proof for the integral case, the fractional ideal generated by all of 
the elements Nx /p(@) for a € J is 


o FIN K/F (7) = N¥/F (g-1H) = NK/F(9), 


as required. 


Example 5.10 In Example 5.8, with K = Q(¢,«) and F = Q, we have the principal prime 
OxK-ideal ADK = (A), and 


N¥/FOOx) = (p) = (Nejr())- 


Example 5.11 In Example 5.9, K = Q(¥v2), F=Q, and 
N¥/2((29)9 x) = (29)° = (Nx/(29)). 


In the next section, we will look at another ideal-theoretic concept called the different, which 
will allow us to say much more about prime decomposition in number fields, especially 
cyclotomic and pure extensions such as those illustrations given in our closing examples for 
this section. 


Exercises 
5.1. Let K/F be an extension of number fields, and let J € Ia,,. Prove that 
JOKNF =I. 


Also, show that if J,d € Ip, with IDx = JOx, then] = J. 
(Hint: Use Exercises 3.81-3.32 on page 121.) 


5.2. Prove that the mapping t%/r given in (5.1) is a group monomorphism that induces 
the map given in (5.2). 


(Hint: Use Exercise 5.1.) 

5.3. Let A/F be an extension of number fields, and let J,J € I,,,. Prove that 

5.4. Let K/F be an extension of number fields and assume that Ox% = O-la] for some 
a € Ox. Let p be a prime 9 p-ideal, and let ™.,r(x) be the polynomial determined 


from the minimal polynomial my, r(x) by reducing its coefficients modulo p. Suppose 
further that 


g 
Ma, F(x) = II Ga); ej EN 
j=l 


where the g;(«) are distinct irreducible polynomials over the field Or /p = Dr. Prove 
that 


g 
nx = |], 
j=l 


where the Pj are distinct prime D x-ideals such that fx;p(P;) = degs—(g;). Further- 
more, show that for each 7 = 1,2,...,g, 


Pj = pOrK + gj(A)OxK. 


5.1. 


5.5. 


5.6. 


5.7. 


5.8. 


5.9. 
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(Hint: Use Theorem A.5 on page 328 and Theorem 1.21 on page 82.) 


(This phenomenon does not always occur, as shown by Example 2.18 on page 79. In 
other words, O« need not necessarily be of the form O« = Orla.) 


Let J and J be nonzero ideals in a Dedekind domain R with quotient field F’. Prove 
that if J 4 R, there exists a y € F' such that yJ C R, but yJ Z I. 


(Hint: Use Exercise 1.88 on page 33.) 


Let F C K CL be a tower of number fields. Prove that if J € Ia,, then 


NS) = NE NE * (9): 


Let K/F be an extension of number fields, and let J € Ia,,. Prove that 
NE = 3", 

where n = |K: F]. 

(Hint: Use Exercise 5.3 and Theorem 5.8 on page 186.) 


Let K/F be an extension of number fields. Show that there exists a number field L 
that is a normal extension of F' containing K. 

(Hint: Use Theorem 1.24 on page 39.) 

Let f(a) € Z[a] be nonconstant. Prove that there exist infinitely many rational primes 
p such that f(x) = 0(mod p), for some a € Z. 

(Hint: Use Theorem A.7 on page 330.) 


. Let K/F be an extension of number fields. Prove that there are infinitely many prime 


O p-ideals that are completely split in Ox. 
(Hint: Use Theorem 1.24 on page 39.) 


. Prove that there are no inert primes in Q(¢2») for any n € N with n > 2. 


(Hint: Use Exercise 5.4 and Theorem 1.30 on page 49) 
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5.2 The Different and Discriminant 


Nothing is so strong as gentleness and nothing is so gentle as real strength. 
Ralph W. Sockman (1889-1970) 
Senior pastor of the United Methodist Christ Church in New York City 


In this section we will develop tools that will allow us to generalize the notion of the 
discriminant of a number field, and prove more powerful results than those achieved thus 
far. First, we need the following. 


Definition 5.5 — The Dual/Codifferent 
Let K/F be an extension of number fields, and let J € I,,,. Then 
J* = {8 € K : Tx /p(85) C Or} 


is called the dual or codifferent of J over F, where Tx p(8J) C Or means TK/p(Ga) € OF 
for alla € J. 


Lemma 5.2 — The Dual is Fractional 
If K/F is an extension of number fields with J € I,,., then J* € I,,. 


Proof. Let a ,,a2 € J*, and 81, G2 € OK. Then 
TK/F((8101 + B202)I) C TK) r(G1I) + Tk r( 25) C OF, 


so J* is an Ox-module. Since J € J,,., then by Definition 1.24 on page 26, there exists a 
nonzero 8 € Ox such that 6J C Ox. By Definition 5.5, all such § are in J*, which must 
therefore be nonzero. 


Claim 5.5 There exists a 8* € Ox such that 
BF COr. 


Let 61, B2,...,8n be a basis for K over F with 8; € Ox for 7 = 1,2,...,n, which is allowed 
by Exercise 2.42 on page 82. Let 6 € JN Ox be nonzero, and set 


B* = Nx;r(8) det(TKr(G:6;)). 
Let , 
y= > 058; € 7" (a; € F), 
j=l 


be arbitrarily chosen. Then 

TK) F(Nx/r(8)78i) € OF, 
since Nx (8); € Ox. However, for each i = 1,2,...,n, 

Tk/F(Nx/r(8)¥8i) = Nx/r(2)TK/r(yG:) = Nx/r(8) S- ajTK/F(G:8;). 
j=l 
Hence, for each such 7, 7, 
ajNxK/r(8) det(Tk/r(i5;)) € Or, 

so B*y € Ox. This establishes Claim 5.5, from which it follows that J* € Ia... 
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Lemma 5.3 — Properties of the Dual 


Let K/F be an extension of number fields, and let J € I,,. Then each of the following 
holds. 


(a) JJ* =O%. 
(b) If J is an integral Ox-ideal, then (I*)~! is an integral O x-ideal. 
(c) Ifd € Ia, andIJC J, then J* D g*. 


Proof. Let a € J*. Then Tx /p(aJ) C Or, so TK p(AIOK) C Ox. Therefore, aJ C O7,. In 
other words, a € J~'!D%,. We have shown that 


CAD 
By reversing the argument, we get that 
5 ies © ee i ae 
Hence, we have J* = J~'D%,, so 
aI De = OR 
which is (a). In particular, if J C Ox, then Ox C I*. Therefore, 
ka es OPE al eee 0a es 


which is (b). 
For (c) assume that J C J. Then for any 6 € J*, 


TK/F(89) © Tk/r(8d) C Or. 


Hence, 8 € J*, so J* D g*. 
By Lemma 5.2, if J € I,,, then J* € I,,.. In particular, by part (b) Lemma 5.3, if J is an 
integral OD x-ideal, then (J*)~+ is an integral D x-ideal. In any case, (J*)~+ is a special kind 
of ideal. 


Definition 5.6 — The Different 


Let K/F be an extension of number fields and let J € I,,,. Then the ideal (J*)~! € Ia, 
is called the different of J over F, denoted by Dx r(J). If J = Ox, then Dx p(J) is called 
the different of the extension K/F, denoted by Dx p. 


We now employ the Galois theory developed in §2.1. 

Lemma 5.4 — Properties of the Different 

Let F C K C L be an extension of number fields. Then each of the following holds. 
1. TfJ¢€ Tne) then DFJ) = ID Kir. 
2. Dye = DryjKDxK/F- 


3. If K/F is normal, then for any o € Gal(K/F’), D&) 7 = Dx pr. In other words, Dkr 
is fixed, also called invariant under the action of the Galois group. The notations 
o(Dx pr) and DE /r for the action of o are used interchangeably. 
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4. IfJ ela, and J e€ Ia, then TK/F(d) CJ if and only if J C IDigir- 


Proof. For part 1, we use part (a) of Lemma 5.3 on the previous page to get, (JJ*)~! = 
(O%)~*, so 


(Kt = (TY OR) = IDK), 
namely Dxjr(J) = ID K/F- 
For part 2, we observe that a € Dik if and only if Ty;~K(a@) € Ox by Definition 5.5 on 
page 196. In turn the latter is equivalent to 


DeypliK(a q) c Dir (5.5) 
by part (c) of Lemma 5.3. Also, (5.5) holds if and only if 


by part (b) of Theorem 5.2 on page 185. Lastly, (5.6) is equivalent to oa that a 


Di je: We have shown that a € Dee 


DxjrDrjp- In other words, DyysF = = Dy/KDK/F- 
For part 3, let 6 € 0%, and o € Gal(K/F). Then since Tx/;p(G9xK) C Or, we have 


if and only if a € DejrDz; namely Dr, = 


L/F? 


Tx/r (87D) = Tj r (89% ) =Txjr(BOx) C Or. 


Therefore, O%-7 C O% for all o € Gal(K/F). Similarly, 0%" | C O%, so O% C OL’. 
Hence, OF, = “on? ; sandals Dip = = Dr. 
Finally, for part 4, Pjr() C Jif and only if I>’ Tr (J) = Txjp(0"9) C Op, which in 


turn holds if and only if J-'g C D’,,,, namely when J C ID; 


K/P? K/F- 


We now are able to generalize the notion given in Definition 2.7 on page 77. 


Definition 5.7 — Discriminant of a Relative Extension 


Let K/F be an extension of number fields. Then the discriminant of K/F is N¥/* (Dx p), 
denoted byA x p. In particular,A xg = ( Ax) is called the absolute discriminant of K. 


The reader should now go to Exercise 5.17 on page 212 for an explicit example of the above. 
An important property of relative discriminants is given as follows. 


Lemma 5.5 — Relative Discriminants in Towers 
If F CK C Lis a tower of number fields, thenA 7/7 = Ap NE/F (Ay x). 


Proof. From part 2 of Lemma 5.4, we have 
Arp =N*/¥ (Dix Dejr) = NYP (DrjK)N*/* (Dejr), 


where the last equality comes from Exercise 5.3 on page 194. By Exercises 5.6—-5.7 the latter 
equals, 


N¥E/FONEIE (Di ic) NE/F (NEI (Diep) = NE/F (Anjx)N*/F DK p) 


= NEP (Ag )N*P (Dacjp El = NEP (An )Aie es 
/ 


as required. 


The next result verifies that the absolute discriminant coincides with the notion given in 
Definition 2.7 as an ideal generator. 
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Theorem 5.7 — Dual Basis, Different, and Discriminant 
Let F be an algebraic number field, and let J € I,,, with Z-basis B = {a1,a2,...,Q@,} and 
set B* = {aj,a5,...,a7,} with a} € F defined by 
Tr/Q(aid}) = On; 
where 6;,; = 0 if i # j and 6;,; =1ifi=j.°° Then the set B* is an integral basis for J*, 
called a dual basis. Furthermore, 
N¥/9(Dp/Q(9)) = N*/2(9)|ArI. 
In particular, 
N¥/9(Dpjq) = |All: 
In other words, as ideals, 


Arg = ( 4). 


Proof. Let A be the matrix with entries (T'p/g(aia;)). From Theorem 2.8 on page 73, we 
know that det(A) 4 0. Thus, A is invertible, so AA~! = I,,. The diagonal of this identity 
matrix consists of elements a;a; = 1, where the 


from Theorem A.22 on page 338. Also, the off-diagonal elements of the identity matrix 
give us that ajaj = 0, with the aj € F similarly determined by Theorem A.22. Hence, 
Trjg(aja;) = 1 and Tr /g(aiaj) = 0 for i # j. This establishes the existence of the 
elements in B*, and so secures the validity of the first assertion. 


Let 8 € F. Then by the definition of the aj, there exist q; € Q such that 8 = jel qj Qj 
Also, for any a € J, there exist z; € Z such that a = S>""_, z;a;. Thus, 


Tr/a( a8) 25 Sr au Qj 
4=1.g=1 


so Tr/g(aB) C Z exactly when q; € Z for 7 = 1,2,...,n, so B* is an integral basis for J*. 


For the assertion on norms, we first assume that J = Op. By the above, DO} = Dro has 
dual basis consisting of the aj. Let m € N such that 
ma; =m; € Or, (5.7) 


which is allowed by Lemma 1.4 on page 38. Let J = mDrjq C Op. Then by Corollary 2.8 
on page 85, 


NFIO(De ig)” = NFO(Im—*)? = NFIS9)?Nejq(m)* = NF/2(9)?m—™, 
and by Theorem 2.12 on page 85, this equals 
disc ({m},m3,...,m*}) Ap ~'m-2” = disc ({at,a%,...,a%}) Ar * 


where the last equality follows from (5.7). 


5-5The 5;.; is called the Kronecker delta. 


tg 
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To complete the proof, we observe that the following matrix equation holds, 


(Tryq(aia})) = (4; (a4))(9;(07))", 


where 61, 02,...,9, are the Q-isomorphisms of F’. Hence by the Kronecker delta symbol, 
this is the identity matrix, so 


disc(ai,...,a%) = dise™'(a1,...,0n) = Art. 
We have shown that N*¥/°(Dp/g)? = Ar’, so 
NF/°(Dpjq) = |Arl- 
By part 1 of Lemma 5.4 on page 197, if J € Ia,, then 


N¥/2(D p/Q(J)) = N¥/2(ID pq) = NF/2(J)NF/2(Dpyjg) = N*/2(9)|Arl, 


as required. 


Corollary 5.3 Suppose that L is a number field with squarefree discriminant A,. If QC 
Kk CL is a tower of number fields, then K =Q or kK =L. 


Proof. By Remark 3.14 on page 116, if K 4 Q, there is a prime p | Ax. By Lemma 5.5 on 
page 198, and Theorem 5.7 on the previous page, 


(p)|"*! | Arig =(4), 


contradicting the squarefreeness, so L = K. 


Corollary 5.4 IfQCK CL is a tower of number fields, then 
Agi? ®| | Ne. 


Proof. By Lemma 5.5, 
LK 
Axjl"' | Azjo, 


which secures the result. 


The reader will observe that Corollary 5.4 generalizes Kronecker’s result given in Theo- 
rem 3.15 on page 126. 

The following result, which was known to Euler in a different form, is another tool in our 
quest to establish a fundamental result in the theory of the different that, in turn, will allow 
us to establish important results in ramification theory in §5.3. 


Theorem 5.8 — Generators for the Dual of a Primitive Extension 

Let K/F be an extension of number fields, with K = F(a) where a € Ox, and set 
|: F| =n. Then 

Orla] 


Ma, p()’ 


Orlal* = 


where m/, » is the formal derivative of my,r7. In other words, O/[a]* is generated as an 
Opr-module by the elements 


a! /m, (a) for j = 0,1,2,...,n—1. 
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Also, 
qi 
Tir | ——7~ | = 9 for all j = 0,1, 2,...,n — 2, (5.8) 
my, p(@) 
and 
q?l 
Tx/r | ——~ | = 1. (5.9) 
: (5) 
Proof. Let a;, for 7 = 1,2,...,n, be the conjugates of a over F’, where a; = a—see Ex- 


ercise 2.1 on page 62. By applying the Lagrange interpolation formula—see Theorem A.26 
on page 342—we get 


is Mo,F (x) “Mea F(2) “ak 
1= : = , = (5.10) 
d. ™M4,,,F(@)(x — ai) 2, m4, ,P(@) d gest 


where the last equality comes from Theorem B.4 on page 347. Also, if 
Me,F(@) = 2" + iS apx”—*, 
k=1 


then 


1 1 ap 
a 5.11 
Ma,F(£) gn oS gntk ( ) 


k=1 


By comparing (5.10)—(5.11), we get (5.8)—(5.9), which also says that 


Ql 
T ——— ] € Or, 
ne (=) 


sO 


In other words, 


It remains to establish the reverse inclusion. Let y € Or[a]*. 


Since the elements ad /m), (a) for 7 = 0,1,2,...,n.—1 form a basis for K over F', we may 
write 
n-1 ays 
y=) 45 
y 7m, p(@) 
Therefore, 


n-1 H 
a 
Tk/F(y) = S- ajlk/F (=a) = Gn-1, 
j=0 


a,F 
by (5.8)—(5.9) established above. Since y € Or|[a]*, then a,_1 € Or. Now let 


n-1 


Ma,F(£) =ar+ S- byx*, with by € Dp, (5.12) 
k=0 
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since a € Ox. Thus, 


Tx/r (ya) s L ae ip a 
K/F(yYQ) = ajlLK/F | a | OF On-2 + On-1L k/F | = 
/ = J / / m! (a) 


mi, @ (a) a,F 


and from (5.12), this equals 
An—2 — An-1 beT Kr | ——a = An—2 — An—16). 
ie \ mi, p(@) 


Since Gn_2 — Gn—1b, € Op, then an_2 € Or. Continuing in this fashion, we see that all 
a; € Or, soy € Orla], which completes the reverse inclusion, and hence the entire proof. 


Now we turn to a concept that will help to explain the term different. 


Definition 5.8 — Different of an Element 
Let K/F be an extension of number fields with K = F(a) fora € Ox. The different of a 
is m/, p(a), denoted by 

Ox/F(Q). 


The reason for the name “different” in Definition 5.8 is that m/, -(a) 4 0 exactly when 
a is different from all of its conjugates over F. In other words, a 4 6;(a) for all F- 
isomorphisms 6; of K that are not the identity embedding, namely when a is a primitive 
element over F'. Now it is important to compare D% with Orla]*. We know that O% C 
Orlal* = Or[al/m), -(q@), since for any 8 € DF,, we must have Tx /p(BOr[a]) C Or, given 
that Orla] C OK for a € Ox. Now we look at the reverse inclusion from the following 
perspective. 


Definition 5.9 — The Conductor 


Let K/F be an extension of number fields, and let R be a subring of Ox such that Or C R. 
Let fr >°® be the greatest common divisor of the D.-ideals contained in R. We call fz the 
conductor of R in OK. 


Lemma 5.6 — Conductor Characterization 


Let K/F be an extension of number fields, and let R be a subring of Ox such that Or C R. 
Then 
fr={BeEK: BR* COX}, 


and fr is an 0 ,-ideal contained in R. In particular, if R = Or[a] for some a € Ox, then 
fo = My P(A) = OK /F(Q)D% 
is the conductor of Or[a] in Dx and is the largest D,-ideal contained in Dla]. 


Proof. Set 
IT={BEK:BR* COR}. 


For 6 € I, y € Ox, we have 
YBR* C YOR COR. 


5.6The letter f is used here since the origin is in the German language, where the term for conductor is 
Fuhrer. 
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Therefore, I is an O%-module. (Observe that if R # Ox, then R* is an O--module, but 
not an Ox-module.) Also, R* C O%, so 

TkjrUIR") © Tx r(OK) C Or. 


Therefore, R* C I*, so 1 C RC Ox. This shows that J is an D,-ideal contained in R. 
Consequently, the DO x-ideal fr divides J. 


Suppose that J is an O,-ideal in R and 6 € R*, namely TK;p(GR) C Or. Then 
Tx;r(BJDK) © Tx;r(BR) C Or. 


Therefore, BJ C 0%,, which implies that J C J. Thus, J divides all O,-ideals in R, so 
I | fr. Hence, 
l=frCR, 


as required. 
Now if R = O(a}, then by the above and the fact that D7 C Orlal*, we get, 


Ma, r(MDK CM, p(@)O lal” = Orla] C Ox, 


where the equality comes from Theorem 5.8 on page 200. Thus, m/, -(@)O% is an Ox-ideal 
contained in fy. Also, from the proof of Theorem 5.8, 


Orla] 
T ——~ ] € Or. 
ie (ee “ 
Therefore, Or[a]/mi, p(a@) C OX, but fa C Orla] by the first part of the proof, so 


fa/Ma,7(@) S Dig. 


In other words, fa C mj) p(a)O%. Hence, 


fa = My, F(A)D%K, 
which is the first required equality. For the second one, we first note that since Ox% C 
Opla}*, and Orla] C OX, then 
{8 € Orla]: BOx C Oerlal} C fa. 
Conversely, from the first proved equality, and the fact that 
Mo, F(a) Die © mg, r(@)Or[a}* = Orla}, 


from Theorem 5.8, then for any 6 € fa = mj, p(Q)O% we get that 6 € Orla] and BOK C 
Orla}. Hence, fa C {68 € Orla] : BOK C Orlal}, so we have the full equality. 
Lastly, f. is the largest 0,-ideal contained in Orla] by the above and Remark 1.15 on 


page 31. 


The reader is now encouraged to solve Exercise 5.18 as an explicit example of the above. 
The following links our previous notion of different to the above. 


Theorem 5.9 — Generation by Differents of Elements 


Let K/F be an extension of number fields. Then Dx/p is the Ox-ideal generated by the 
6% /F(@), where a runs over the elements of Dx. 
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Proof. By Lemma 5.6 on page 202, 6«/p(a) € Der for alla € Ox. In fact, from that 
lemma we have: 

OK/F(Q)OK = faD jr. 
Now we show that it suffices to find an a € Ox such that P { f, for all prime D,-ideals P. 
If such an a exists, then the ideal 7 ,c9,. fa generated by Useo x. fa must equal Ox. Hence, 


Dejrp = Dx/rOK = Dx/r ( S- ‘ = yi DxjPoK/F(Q)OK = 


acoK acOK 
Ss Ox/F(Q)OK (Die) =>) Ox/F(Q)Ox, 
aeDK acOK 


which shows that Dx /p is generated by the dx/r(a). Hence, it remains to show that such 
an @ exists. 


Let P be a prime x-ideal, and let p be the prime 0 -ideal lying below it. Furthermore, 
suppose that pO x = P°J, where e CN and Pt J. 


Claim 5.6 There exists an a € J, with a ¢ P, such that its residue class, @, modulo 
P is a generator of the multiplicative group of nonzero elements of the field 0% /P—see 
Exercise 4.25 on page 163. 


Let 6 € Ox with 6 ¢ P. Then # is a generator of the multiplicative group of nonzero 
elements of the field Ox /P. By Exercise 4.31 on page 164, 


BN) = B (mod 9). 
If BN) = 6 (mod P?), then let y € ?, with y ¢ P?. Then 
B+7=8 (mod P) and (6+ 7)" =pN™ (mod P?), 
by the Binomial Theorem—see Corollary A.11 on page 341. Therefore, 
(B+7)N = B+ (mod P) and (6+7)% #8+7 (mod ¥). 


Since J and ? are relatively prime, then O% = P? +9, so 6+ y = 8, + a where f, € P? 
and a € J. Thus, 


a=6+7-6,=8 (mod 9), 


so @ is a generator of the multiplicative group of nonzero elements of the field Ox /P, and 
a ¢P. This completes Claim 5.6. 


Claim 5.7 Let n € N and suppose that S is a system of n representatives of 9% modulo 
P, with 1 € S and let w € P, w g P?. Then 


n-1 
N° ajw) : a; € § for j =0,1,...,.n-1 ; 


j=0 


is a system of representatives of Ox% modulo ?”. 
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We use induction on n. If n = 1, then S = JT, so we have the induction step. Assume, for 
the induction hypothesis that the result holds for n — 1. Let t),t2 € J with t; = 6; + ws; 
for j = 1,2. if ty = to E Pe, then (by bz) pr ($1 82)W C P. Thus, by = bg and 
(s1 — 82)w € ®”. Therefore, there exists an DO ,-ideal J such that 


OK (s1 — 82)w = P'. 
Also, since P? { (w) =wDx, there is an Ox-ideal I, not divisible by P such that 
Onw = PI. 


Thus, 

Py = OK (s1 came 82)W = OK (s1 _ 82) PI, 
so 

pret | OK (s1 = $2). 

Therefore, s1 — s2 € P"~1. By induction hypothesis, s1 = sz, so ty = tz. We have 
shown that for any ¢1,t2 € J, with t; — tg € P”, we get t; = to. Hence, J has N(P)” 
different representatives of Ox modulo ®”. By Exercise 4.25 on page 163, J is a system of 
representatives of O% modulo P. This is Claim 5.7. 


Claim 5.8 For anyn €N and any 6B € Ox, there exists a unique y € Orla] such that 


B=7 (mod Pf"). 


Let w = aN) — qa. Observe that by the same argument as used above on 8 +y we get that 


w ¢ P?. Thus, by Claim 5.7, for any 8 € Ox, there exists a unique ae ; ) ajww4 € Orla] 
with a; € S such that 6 —y € P”. This is Claim 5.8. 


Finally, we now show that P { fa. 


Let BE Oxm, p(a) OF. Then Op = pd, where a is a nonzero integer and the prime 
p below P does not divide the D-ideal J. Consider 


BP Op= (o1) and peer — (a2). 


Therefore, 71 = 203 where 03 € J"®. Also, 03 ¢ p since p?”’" +1 + (a1). We now demon- 
strate that Ox(c30%"F) C Orla]. By Claim 5.8, for any given p € Ox, there exists a 
7 € Orla] such that p— y € P°*"F. Since 


ahr _ ( ahr 


po3a p—y)a3a"F + yo30 


and yo30°"F € Opal, then it suffices to show that (p — y)o3a%”"" € Orla]. Since (a2) = 
p?’r, then 


OK (p— y)o203D Kate — Ona, PFO par 
hp — VK\P — ¥)/9293VK KO K 
Ox(p— oso"? = west Ce 


COKB C Delo, 


where the penultimate inclusion comes from the fact that a ©€ J, and the final inclusion 
arises from the fact that mj, p(a@)9«K C Orla] by Exercise 5.12 on page 211. Having shown 
that the ideal Ox(o30%"") C Opa], then it follows from Lemma 5.6 on page 202 that 
OK (o30%F) C f,. Since o3 ¢ p, then 3 ¢ P. Thus, since a ¢ P, we get by primality that 
o3a%"r ¢ P. Hence, fa Z P, namely P+ f., which is the entire result. 


From the proof of Theorem 5.9 emerge two immediate consequences. 
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Corollary 5.5 If K/F is an extension of number fields, then for all prime Ox-ideals P 
there exists an a € Ox such that Pf fa. 


Corollary 5.6 — Dedekind, 1881 


Let K/F be an extension of number fields, and let P be a prime Dx-ideal. Suppose that J 
is an Ox-ideal not divisible by ?. Then there exists an a € J such that for all @ € Ox and 
any n €N, there exists an element y € Orla] with 


B=7 (mod Pf"). 


Our next goal is to establish what may be considered as the main result in the theory of 
the different, namely the link between the different and ramification. We will prove that 
the primes that ramify in an extension K/F of number fields are precisely those primes 
that divide the different, and therefore that there are only finitely many of them. There 
are many methods in the literature for achieving such a task. One of them involves an idea 
put forth by Weil in 1943—see Biography 5.1 on page 211. He observed that the different 
is intimately linked to the notion of abstract differentiation in commutative rings. 


Definition 5.10 — Derivations 


Let R be a commutative ring with identity and let M be an R-module. A homomorphism 
0 from R into M is called a derivation of R on M provided that, for all a,6 € R, 


d(aB) = ad(B) + B2(a). (5.13) 
If T is a subring of R such that a derivation 0 of R on M satisfies 
d(a) = 0 for alla ET, 


then 0 is called a derivation of R on M that is trivial on T. In the case where M is 
a commutative ring, a derivation 0 is deemed to be essential if there exists an element 
+ € 0(R) such that y is not a zero divisor. 


Remark 5.2 Observe that since 0 is a homomorphism of additive abelian groups, then in 
addition to (5.13), we have that 


(a + B) = (a) + 0(), 


for all a,8 € R. Also, note that (5.13) is the analogue of the standard product formula for 
derivatives in elementary calculus. 


The reader may now solve Exercises 5.14—5.16 on page 212. 


Theorem 5.10 — Differents and Derivations 


Let K/F be an extension of number fields. Then Dx p is the least common multiple of all 
Ox-ideals J for which there exists an essential derivation 


dV: OK KH OK/I 


that is trivial on Dp. 
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Proof. We first show that it suffices to prove the result for J = P” where n € N and P is 
a prime Dx-ideal. Let J = Mess vs where the P; are prime 0 -ideals, and a; € N for 
j =1,2,...,n. Suppose that 

I: OKH OK/I 


is an essential derivation of Ox into Dx / Pe , which is trivial on Of. Then 
0; ; OK > Dn P; 
defined for each 6 € Dx by . 
0;(8) =0(B) (mod %) 


is also an essential derivation of Dx into Ox ee that is trivial on Or. Conversely, if 
0; is an essential derivation of Ox into Ox / PF? that is trivial on O;, then the n-tuple 
d = (01,...,0,) acting on 


OK/IS [[9«/9%: 
j=l 


via Theorem 1.21 on page 32, the Chinese Remainder Theorem, induces a derivation 0’ of 
Ox into Ox /I that is trivial on Or. It remains to show that 0’ is essential. Suppose that 
the n-tuple (G1,...,8n) € Ox is such that 0;(;) is not a zero divisor in Dx«/P;’. By 
Theorem 1.21, again, we may choose 6 € Ox such that 


8 = B; (mod P%") 


for each 7 = 1,2,...,n. Therefore, 0’(3) is not a zero divisor in D,%/J. 


Our remaining task is to prove that an essential derivation of Ox into Ox /P” exists if and 
only if P” | Dx r- If 0 is such a derivation, then by Corollaries 5.5-5.6, we may select an 
a € Ox such that P{f., and for any n € N and any 6 € Ox we have 


8 = g(a) (mod P**) 
for some g(a) € Orla]. For such a congruence, we get from Exercise 5.15 that 


0(8) = 0(g(a)) = g'(a)o(a), 


where the last equality follows from the very definition of a derivation, with g’ being the 
derivative of g. If 0(a) is a zero divisor, then 0(G) is a zero divisor for all 8 € Ox, 
contradicting the choice of 0. Thus, 0(q@) is not a zero divisor, so 


0 = 0(0) = 01M, r(a)) = m4, p(@)d(a). 
Therefore, 
mi, p(a) =0 (mod P"). 
By Lemma 5.6 on page 202, 
My, F(A)Di¢ = faD ej PF, 
but Pt fas so ‘p” | Dir. 


Conversely, assume that P” | Dkr, and select a € Ox such that P{ fa. Let 8 € fa with 
B€¢P. By Lemma 5.6 again, every y € Ox may be written as 


(a) 


Yre > 
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where g(x) € Op|az]. Since 8 € fy C Orla], by Lemma 5.6 one more time, then 3 = h(a) € 
Orla]. Since 8 ¢ P, then 8 has a multiplicative inverse 0 € Ox modulo P, namely 


Bo =1 (mod P"). (5.14) 
Define for each y = g(a)/B € Ox, 
0(7) = (g'(a)h(a) — gla)h'(a))o? (mod P"). (5.15) 
Claim 5.9 0 is an essential derivation of Ox into 0% /P” which is trivial on Op. 
If y = g;(a)/6 for j = 1,2 are two expressions for y € Ox, then 
g(a) — g2(a) = 0, 
so there exists a k(x) € Or[a] such that 
g(@) — 92(@) = Ma,r(x)k(2). 
Therefore, 
G1(@) — h(a) = my w(a)k'(a) = 0 (mod P"), 
where the congruence comes from Theorem 5.9 on page 203, since 
pr | Di/F | mi, r(Q)OK. 
This shows that (5.15) is well-defined. 


If we consider the product 


(Gy at _ gi(@) go(@) 
Page ene PERE B 3 


then 8; = ogi(q@) and 82 = ogo 
810(B2) + 822(B1) = o91(@) [9(a)h(@) — g2(a)h'(a)] 0? 

+092(a) [91(a)h(a) — gi(a)h'(@)] 0? 

= 0° ([91(@)92(a) + 95 (@)g2(a)] h(a) — 291(@)g2(a)h'(a)) (mod P"). (5.16) 


a). Therefore, 


Since 
g1(@)g2(a) = Bg(a) = g(a)h(a), 
then for some (x) € Dp|{s, 
g1(@)g2(x) = g(@)h(a) + mar (a)e(a). 

By differentiating the latter, evaluating at « = a, and looking at it modulo P”, we achieve, 

g1(@)9o(@) + 91 (@)g2(a@) = g'(a)h(a) + g(a)h'(a) (mod P”). (5.17) 
Hence, by comparing (5.16)—(5.17) and using (5.14), we get 

819(82) + B20(81) = 07 (g'(a)h(a) — g(a)h'(a)) (mod P"), 
so 

B10(B2) + 820(B1) = 0((81 8). 

Thus, 0 is a derivation, and it clearly is trivial on Or. Since 0(a) is the identity of Ox /P”, 
then 0 is essential. 


This completes the proof.>:” 


We are now in a position to establish the following main result. 


5.7Observe that (5.15) is the analogue of the quotient rule in elementary calculus. 
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Theorem 5.11 — Fundamental Theorem of the Different 


Let K/F be an extension of number fields, P a prime D,-ideal with p the prime 9 p-ideal 
below it, and set e = ex /p(P). Then 


PE™ | Dien: 
Furthermore, if gcd(e, N¥/2(p)) = 1, then 
P°t Deir. 


Proof. Let pO x = P°I, where p= PNOp, P{ I, and e = ex;p(P). If 8 € PI, there must 
exist an n € N sufficiently large such that 


BP” EPP IC POR, 


where p = pM Z. By Exercise 5.8 on page 195, there exists a normal extension LD of F 
containing AK. Thus, if 0; for 7 = 1,2,...,d = |L: F| are all of the F-isomorphisms of L 
into C, then for each such j, 0;(3”") € Or. Hence, 


Tr/F(B? ) EPOLAOPF=p. 
Thus, by the Binomial Theorem, 


Tr jr(B”) —Trx(B)? €p. 


Hence, T,/« (8) € p, so 
TrjK(PI) Cp. 


Therefore, by part 4 of Lemma 5.4 on page 197, 


PEC HD re 


In other words, 
PID K/F Cpc PORK = Pe7. 


Hence, Dx /p C P°', namely 
Pe | Dippy 


as required for the first statement. 


Now we establish the second statement. By Theorem 5.10, it suffices to prove that every 
derivation 0 of Ox into Ox /P*, which is trivial on Op, satisfies that 0() is a zero divisor 
for all 6 € Ox such that 0(3) 40. 


We break this into three cases. 
Case 5.1 GE P—F? 
Let a € p—p?. Then there exist y,o € Ox —P such that a = 8°y/c, so 
oa = Bey. 
Therefore, since a € O-, and 0(G°) = 0, then 
0 = 08°) = B°(y) + 70(8°) = 70(8°) = yeH*10(8), 


where the last equality is from Exercise 5.15. Since p{e, y ¢ P, and 6°! ¢ P°, then 0(f) 
is a zero divisor. This completes case 5.1. 
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Case 5.2 BE?" forn > 2. 


We may assume that n < e, since otherwise 0(3) = 0. Also, we may assume that 6 ¢ P"—!. 
Thus, there isa p € P—#? and y,0 € Ox —P such that 8 = p"y/o. Therefore, by Exercise 
5.15, 
0(0B) = O(p"7) = 72(p") + p"d(7) = ynp™*(p) + p"d(), 
and the right-hand side is an element of ?”/P°, so the left-hand side is also such an element. 
However, 
(08) = 00(B) + B2(o), 
and 80(c) € P"/P£, so od(B) € P”/P°. Consequently, (cd(3))" € P*, so since o ¢ P, 
then 0(3) is a zero divisor in 0% /P*. This is Case 5.2. 
Case 5.3 6 ¢P. 
By Exercise 4.31 on page 164, 
BNx0(?)-1 =1 (mod P), 

so there exists an a € P such that 

BNxeP)-l —1 4a, 
By Exercise 5.15, 

o(BNK/AO)1) = (Nxja(P) — HANKRO(B), 


and we also have that 

o(BNa/e)-1) = a1 +a) = 2(a), 
where 0(a) € P is a zero divisor, since Ba = B(BN«/0()-1) = BNx/o) — BE P. Given 
that (Nx/o(P) — 1) BNx/0)-? ¢ P, then 0(@) is a zero divisor. This completes Case 5.3, 
and so the entire result. 


The following consequences of Theorem 5.11 are the promised links between the different 
and ramification. 


Corollary 5.7 If P is a prime D,-ideal, then P ramifies in K/F if and only if P | Dx/F- 
Consequently, there are only finitely many ramified primes in K/F. 


Proof. The first assertion is immediate from Theorem 5.11. That there are only finitely 
many follows from the first assertion via Exercise 2.52 on page 86. 


Corollary 5.8 A prime 0--ideal p ramifies in K if and only if p | Axk/F- 


Proof. Tf 

p | Ax/r= N¥/F Dip), 
then P|p for some prime 0 x-ideal dividing Dx,/p. By Corollary 5.7, P must ramify in K/F, 
whence, p ramifies in kK. Conversely, if p ramifies in K, there exists a prime Ox ideal P 
above p which ramifies in K’/F’. By Corollary 5.7, P | Djp, 80 


p | pfx/r(?) ns NK/F (9) | N*/F (Dicip) -_ A/F; 


which follows from Exercise 2.46 on page 86. 


The interpretation of Theorem 5.11 on the preceding page will be expanded in §5.3 when 
we introduce ramified and unramified field extensions. 
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Biography 5.1 André Weil, pronounced vay, (1906-1998) was born on May 
6, 1906 in Paris, France. As he said in his autobiography, The Apprenticeship 
of a Mathematician, he was passionately addicted to mathematics by the age 
of ten. He was also interested in languages, as evidenced by his having read 
the Bhagavad Gita in its original Sanskrit at the age of sixteen. After graduat- 
ing from the Ecole Normal in Paris, he eventually made his way to Gottingen, 
where he studied under Hadamard. His doctoral thesis contained a proof of the 
Mordell-Weil Theorem, namely that the group of rational points on an elliptic 
curve over Q is a finitely generated abelian group. His first position was at Ali- 
garh Muslim University, India (1930-1932), then the University of Strasbourg, 
France (1933-1940), where he became involved with the controversial Bourbaki 
project, which attempted to give a unified description of mathematics. The 
name Nicholas Bourbaki was that of a citizen of the imaginary state of Pol- 
davia, which arose from a spoof lecture given in 1923. Weil tried to avoid the 
draft, which earned him six months in prison. It was during this imprisonment 
that he created the Riemann hypothesis—see Hypothesis B.1 on page 354. In 
order to be released from prison, he agreed to join the French army. Then he 
came to the United States to teach at Haverford College in Pennsylvania. He 
also held positions at Sao Paulo University, Brazil (1945-1947), the University 
of Chicago (1947-1958), and thereafter at the Institute for Advanced Study at 
Princeton. In 1947 at Chicago, he began a study, which eventually led him to a 
proof of the Riemann hypothesis for algebraic curves. He went on to formulate 
a series of conjectures that won him the Kyoto prize in 1994 from the Inamori 
Foundation of Kyoto, Japan. His conjectures provided the principles for mod- 
ern algebraic geometry. His honours include an honorary membership in the 
London Mathematical Society in 1959, and election as a Fellow of the Royal 
Society of London in 1966. However, in his own official biography he lists his 
only honour as Member, Poldevian Academy of Science and Letters. He is also 
known for having said In the future, as in the past, the great ideas must be the 
simplifying ideas, as well as God exists since mathematics is consistent, and the 
devil exists since we cannot prove it. This is evidence of his being known for 
his poignant phrasing and whimsical individuality, as well as for the depth of 
his intellect. He died on August 6, 1998 in Princeton, and is survived by two 
daughters, and three grandchildren. His wife Eveline died in 1986. 


Exercises 


5.12. Let K/F be an extension of number fields, and let a € Ox such that K = F(a). 
Prove that mi, p(@)Ox« CS Or{al. 


5.13. Let K/F be an extension of number fields, and let a € Ox such that K = F(a). Prove 
that (0%)~' = Om), p(a) if and only if Ox = Orla]. (Hint: Use the Lagrange 
Interpolation Formula in Appendix A.) 

5.14. Let K/F be an extension of fields. Show that the set of all derivations of F in K 
form a vector space over K. For two given such derivations 0;, 02, define the bracket 


operation, 
[01,02] = 020; — 0402. 


Show that the bracket operation is a derivation of F into K. Furthermore, for any 
three such derivations 0; for 7 = 1,2,3, establish the Jacobi identity: 


[[21, 02], 03] + [[O2, 03], 01] + [[03, 01], 02] = 0. 
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5.15. 


5.16. 


5.17. 
5.18. 
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(Hint: In the process of verification, establish and use the fact that the bracket opera- 
tion is anticommutative, namely that [01,02] = —[02,01].) (The resulting vector space 
with the bracket operation forms a nonassociative algebra, called a Lie Algebra, over 


K, and the bracket operation is called a Lie Product, or commutator product.) 


Let S be a commutative ring with identity and R a subring, and 0 a derivation of R 


into S. Prove that for alln € N andae R, 


a(a”") = na"—'d(a). 


Let K/F be an extension of number fields, and let P be a prime Dx-ideal. Suppose 


that for a given n EN, 


Do 
W:0KH Ze 


is a derivation of Ox into Ox /P”". Prove that 0(a) = 0 for all ae Prt. 
Let F = Q(V10), and J = (2, V10). Find I*, Dpjg(J), Dryg, I*71 andA pyq. 


With reference to Exercise 5.17, find f 7. 


Biography 5.2 Marius Sophus Lie (1842-1899) was born on December 17, 
1842 in Nordfjordeid, Norway. Ludwig Sylow (1832-1918) was one of Lie’s 
teachers at the University of Christiana (which became Oslo in 1925), from 
which he graduated in 1865. In 1869, Lie went to Berlin where he met Felix 
Klein (1849-1925). This began a collaborative effort that resulted in several 
joint publications. Among the consequences of this work is Klein’s characteriza- 
tion of geometry involving properties invariant under group actions, which was 
established in 1872. Asa result of the Franco-German war of 1870, both Lie and 
Klein left France. Lie planned to go to Italy, but was arrested as a German spy, 
with the unfortunate assumption being made that his mathematical notes were 
coded messages. Only after the intervention of Gaston Darboux (1842-1917), 
a leading French geometer at the time, did Lie get released. Lie then returned 
to Christiana, and obtained his doctorate there. He began an investigation 
of differential equations in an attempt to find an analogue of Galois theory. 
Ultimately, he was led to a structure that we now call a Lie algebra. He aban- 
doned the study of partial differential equations in favour of his new structure. 
In 1900, Elie Cartan (1869-1951) published the classification of semisimple Lie 
algebras. However, Wilhelm Killing (1847-1923) had independently introduced 
Lie algebras with a different purpose since his interest was non-Euclidean ge- 
ometry. Lie collaborated for about a decade with Friedrich Engel (1861-1941). 
Their joint publication in 1893, Theorie der Tansformationgruppen appeared 
in three volumes, and perhaps best represents Lie’s major work on continuous 
groups of transformations. Engel was sent by Klein to study under Lie. Engel 
became Lie’s assistant in 1892 when Lie succeeded Klein for his chair at Leipzig. 
In 1898, Lie returned to Kristiana, the intermediate name taken by Christiana 
before it became Oslo. There he took a chair that had been specially created 
for him. However, he died shortly thereafter on February 18, 1899. 
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5.3 Ramification 


Everything is what it is, and not another thing. 


Joseph Butler (1692-1752) 
English bishop and theologian 


In this section we look at the following concept in extensions of number fields employing 
the notions presented in §5.2 


Definition 5.11 — Ramified and Unramified Extensions 


If K/F is an extension of number fields such that there does not exist a prime 9 x-ideal, 
which is ramified in K/F, then the extension is said to be unramified.°*® At the other end 
of the spectrum are those extensions for which there exists a prime D x-ideal P with 


€x/F(P) = [ic : F\, 


in which case the extension is called totally ramified, fully ramified, or purely ramified at P. 
If P is a ramified prime Dx-ideal with 


POAZ= (p), and pf ex/r(P), 


then ? is said to be tamely ramified in K/F. An extension K/F is said to be tamely 
ramified, provided that all ramified primes in K/F are tamely ramified. Thus, in particular, 
unramified extensions are tamely ramified. When p divides ex p(P), then P is called wildly 
ramified, and the extension is called wildly ramified at P. 


Corollary 5.9 If K/Q is an unramified extension, then K = Q. 


Proof. By Remark 3.14 on page 116, if K 4 Q, then |Ax/g| > 1. Therefore, there must 
exist a ramified prime in K/Q, by Corollary 5.8 on page 210. 


Remark 5.3 In view of Definition 5.11, the Fundamental Theorem of the Different, The- 
orem 5.11 on page 209, says that any tamely ramified prime O,-ideal P in K/F satisfies 
the property that 


pe-l | D/P, but Pet Dice where e = €x/F(P). 


Hence, if K/F is normal, then p” { Ax;r, where p = PM F, and n = |K : F|—see 
Exercise 5.20 on page 219. Later, we will see that the converse is also true, namely, that a 
normal extension for whichA x; is not divisible by the nt” power of a prime 9 p-ideal p 
must be tamely ramified at p—see Exercise 5.46 on page 253. 


Now we look at ramification in composita of number fields—see Application A.1 on page 325 
and the discussion surrounding it. 


5-8This includes the so-called infinite primes, namely the embeddings of F into C. This is the term used 
in class-field theory—see Theorem 5.21 on page 239. In an arbitrary extension K/F of number fields, a real 
embedding of F into C that extends to a complex embedding of K into C is said to ramify—see Exercise 2.11 
on page 63. Thus, these infinite “primes” that ramify must be excluded as well. We explore and develop 
the notion of these infinite primes in Exercise 5.24 on page 220. The primes that are not infinite are called 
the finite primes. 
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Theorem 5.12 — Ramification in a Compositum of Number Fields 


Let the number fields kK; for 7 = 1,2 be extensions of the number field F’, and let L = Kk, K2 
be the compositum of Ky and K2 over F. Then a prime O;-ideal p dividesA ,/p if and 
only if it dividesA Ki/FAK3/F- 


Proof. By Lemma 5.5 on page 198, any prime divisor ofA x, /7Ax,/p is a divisor ofA 7/7. 
Conversely, assume that the prime O-ideal p dividesA ;/p, and p{ Ax,/r. Thus, there 
exists a prime 0 -ideal P such that P | Dy/r and P lies over p. Since 


Dyjr = Dr/K,PK,/F; 


by part 2 of Lemma 5.4 on page 197, then P { Dx, /rOxK,. Thus, P | Dz/K,. Select a € Ox, 
such that Ky = F(a). Then, by Theorem 1.23 on page 38, 


MoF(t) = Me,K, (x) F(x), 
for some f(x) € Ky|x]. Therefore, 
mM, F(a) = My,K, (a) f(@). 


This implies that 
My, F(a) © Ma,K, (@)OK.- 


However, by Theorem 5.9 on page 203, 
My Ky (a) € DryK,, 


and since P | Dry/K,, then 
Dep ST: 


Therefore, mj, x, (a) € P, so mi, p(a) € P. Hence, by Theorem 5.9 again Dx, C P, so 


P | Dz/F, as required. 


Corollary 5.10 If K;/F is unramified for j = 1,2, then kK, K2/F is unramified. 


Proof. This is immediate from Theorem 5.12, and Corollary 5.8 on page 210. 


Corollary 5.11 Let F C K C L be a tower of number fields, where L is the smallest 
extension field of F containing K such that LZ is normal over F. Suppose that p is a 
nonzero prime Op-ideal. ThenA ;/;7 andA x, have the same prime divisors, so p is 
unramified in L/F if and only if p is unramified in K/F’. 


Proof. Let 6; for 7 = 1,2,...,n be all of the embeddings of kK into C. Then 
Doi emi 


the compositum of all the embeddings. The result now follows from Theorem 5.12, and 
Lemma 5.5. 


Remark 5.4 The above results set the stage for later when we develop the so-called Hilbert 
class field, which is the maximal, unramified, normal extension of a given number field such 
that the Galois group is abelian. This Galois group will be shown to be isomorphic to the 
class group of the base field via the celebrated Frobenius automorphism. The Hilbert class 
field is called the maximal abelian unramified extension of the base field. There is much 
power yet to be developed. 
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We continue with further results on composita of number fields and their discriminants. 


Theorem 5.13 — Discriminants and Degrees of Composita 
Let K; for 7 = 1,2 be number fields with 


gcd(Ax,, 4x3) = 1, 


and 
Then each of the following holds, where L = KK is their compositum. 
(a) |Z: Q|=nyznz. 


(b) DO, = Ox, Or,, and if {aj,...,Q@n,} and {61,...,6n,} are integral bases of Ay and 
Ko, respectively, then {a;8;} for 1 <i<mi,1< J < ng is an integral basis for L. 


(c) A, = AR Am, 


Proof. (a) We have 
|L: Q) = |L: Ko|-|Ko: Q| =|L: Ka|ne. 


If |Z: Q| < nyno, then |Z: Ke| < ny. Let K = Q(a). Then myx, (2) | Ma,Q(x). If F is the 
subfield of kK generated by the coefficients of ma,x,(x), then F £ Q. Since F' C Ko, then 
AF | Ax,, by Theorem 3.15 on page 126. Since the coefficients of ma,x,(x) are elementary 
symmetric functions—see Definition A.16 on page 333—of the roots of ma,x,(x), then 
Mea,Ko (x) € Ni [x], where Nj is the smallest Galois extension of Q containing K,. Therefore, 
F C Nj, so as aboveA pf | Ay,. Let p | Ar be a prime. Then p | An, and p | AxK,, SO 
v7) | Ax,, by Corollary 5.11, contradicting the hypothesis that gcd(A;, A) = 1. This is (a). 


(b) Since 0%, OK, is the smallest subring of DO, containing both Ox, and Ox,, then {a;{,;} 
for 1 <i<m,1<Jj< no, isa Z-basis for 0x%,0xK,. Therefore, 


disc ({ai8;}) => det (a9; (ax Be))*, (5.18) 


where the o; are the Q-isomorphisms of Ky and the 6; are the Q-isomorphisms of Kz. The 
determinant in (5.18) is the Kronecker product 


det(ai(am))°"? x det(0;(Be))°"* = Ap? AR 
—see Definition A.21 on page 339. Thus, by the very definition of a field discriminant given 
in Definition 2.7 on page 77, {a;6;} is an integral basis for L. 
(c) By Lemma 5.5 on page 198, 


Arq = Aig NR /2(ArjK,) = AB jg! AriK,): 


and similarly 


Ar/e = Ax, /0 


Hence,A {2 andA 7%, both divideA ;, and since gcd(Ax,,A4x,) = 1, then 


NPIL A ac): 


ne ny 
APNG 


Since disc ({a;8;}) = Al? AT}, by part (b), thenA ;, = AV? AZ. 
The following application of Theorem 5.13 fulfills the promise made at the top of page 126. 
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Theorem 5.14 — Discriminants of Cyclotomic Fields 


Let n € N, n > 2 and set K = Q(¢,), where ¢, is a primitive n*” root of unity. If 
n= j= p; for distinct primes p; and a; € N, then 


1) (7) /2n (nm) 


o(n/p;7 ( 
Ak = I aS ee _ ——_, 
Qe, Ge ) 3 pi )/ (p35 1) 


Proof. We use induction on r. Corollary 3.9 on page 125 establishes the induction step 
r = 1, so we assume the induction hypothesis, that the result holds for r — 1, where r > 1. 
Thus, by Corollary 3.9, and the induction hypothesis, 


ged(Ag¢,,) Aa(car)) = 1 


where n’ = n/p%. Therefore, by part (b) of Theorem 5.13, 


_TT adtr/es? alee") pate 
Ac = [Age 1 = a AC 


However, by the induction hypothesis, 


$(p2") (-1)? o(n')o(p Pr” (r—1)/2 (ny ‘) $(n')o(pr”) (—1) BOY“)? ay or) 


Qn) = = - iC Te 
fi 1 Gore 1))d(p2") Tint of iD 


and by Corollary 3.9, (or the induction hypothesis), 


ate) (<1) @PF/2)600") yr PIA (_yy oC /Aparolo) 
eM oc ~ ee 


Hence, by multiplying the last two expressions together, we get the final result. 


Corollary 5.12 A rational prime q is ramified in Q(¢,) if and only if ¢ 


Proof. This follows from Theorem 5.14 via Corollary 5.8 on page 210. 


We conclude this section with a result on prime decomposition, without ramification, in a 
cyclotomic extension. 


Theorem 5.15 — Prime Factorization in Cyclotomic Extensions 


Let K be a number field, and n € N. Set L = K(¢,), where ¢, is a primitive n“” root of 
unity. Suppose that p is a prime Ox-ideal with n ¢ p, and 


fxjo(P) = ex/o(p) = 1. 
If f € N is the smallest value such that pf = 1(mod n), where (p) = p/N Z, then 
pO, =P,---Po, 
where the P; are distinct prime D7-ideals with fr/~(P;) = f for each j = 1,2,...,n and 


fg =|L: Kl. 
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Proof. The extension L/K is normal since any K-isomorphism 6 of L satisfies 0(L) = L. 
Hence, we need only show that f,/«(p) = f, since once this fact is proved, the remaining 
facts fall into place as an immediate consequence of Theorem 5.4 on page 189. To see that 
p is unramified in L/K, use Corollaries 5.10 on page 214 and 5.12 on the preceding page. 


Claim 5.10 If fe, is the conductor of Ox[Gn] in Ox, then fe, | nO yp. 


First, we have that 
x" —1L=mg,,«(x)9(2), 


for some g(x) € Ox|ax]. Therefore, by taking derivatives and setting « = ¢,, we get 


nO = Oye (Gilgl(Gn)s 


Since ("-1 € Up,, then 
64/K (Gn)OL | nO yp. 


Thus, fc, | nO , by Lemma 5.6 on page 202, which secures Claim 5.10. 
By Claim 5.10, and by part (a) of Exercise 5.19 on page 219, 


NUKE ) | NEA, 


Therefore, since n ¢ p, then p { N4/*(f,,). This allows us to invoke Exercise 5.23 on 
page 219. Hence, for each y € Oz, there exists a polynomial k(x) € Ox [2] such that 


y=k(¢,) (mod pz). 

Thus, by the Binomial Theorem—see Corollary A.11 on page 341— 
f f f 
NOM = k(Gn)NO” = kG”) = (Cn) (mod pOz), 

where we are using Definition 2.8 on page 83 for the norm exponents. Thus, 

yO =m (mod), 
for each 7 = 1,2,...,g. By Exercises 4.30—4.31 on pages 163-164, the exponent m given by, 

(m) = NE/O(P;) = NEI p)fera 


is the smallest one such that 
y” =7 (mod P,), 


for all y € Oy and a given j = 1,2,...,g. Therefore, fr/x(p) < f. If 
N4/Q(p,)ft/K) #1 (mod n), (5.19) 
then G/"" 4, is a primitive n'” root of unity, and 
Cla ey, 
Hence, we have the basis discriminant containment: 
GiseC] Gagena ie) eee 


Thus, 
dise ({1, Cn. +. Gn }) € P| NZ = (p), 
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where the prime p does not divide n. Since 


disc ({1, Cn dors cae) = AQ(En) 


by Definition 2.7 on page 77, and Theorem 3.14 on page 123, then this contradicts Corol- 
lary 5.12 on page 216. Hence, the assumption (5.19) was incorrect, so 


N(P;)fasK®) =] (mod n), 


and f < fr/K(p), by the minimality of the choice of f. Hence, f = fr/K(p)- 


Corollary 5.13 Let K = Q(¢,) for n € N with n > 2. Suppose that p is a rational prime, 
with n = p*n’, where a is a nonnegative integer, p { n’, and f is the least natural number 
such that p/ = 1(mod n’), then 

pOx = Pi-++P,, 


where 
ex/o(p) = o(v") = 1, fg = o(n’), 
and all ?; distinct prime 0 x-ideals with 


fxjo(P;) =f= fx/o(p): 


Proof. If n = n’, namely when a = 0, then the result is an immediate consequence of 
Theorem 5.15 on page 216. If a E N, then let F = Q(¢,,). Therefore, 


|F : Q| = O(n’), and |K : Q| = 4(n), 


so 
|K : F| = $(n)/o(n’) = o(*) = |Q(pe) : QI), 
via Corollary 1.17 on page 41. By Theorem 5.15, 


por =Pi--- Py 

for distinct prime 0 p-ideals p;, 7 =1,...,g, and g = ¢(n’)/f. Moreover, 

pOK = (Pi---Pm)® (5.20) 
for some m,e € N. However, by Theorem 5.4 on page 189, 

mefix/q(Pj) = |K : Q| = on). 

Since p is not ramified in F'/Q, then e | |X : F| = d(p*). By Example 5.8 on page 190, 

p=ul— Cpa oe 
where u € Uzi¢.], 80 using (5.20), we get 

POK = (1— Cpe PPI OK = (Pro Pin)®, 


but e | o(p*), so e = $(p*) is forced. Since m > g, given that each p,; could decompose 
further in K/F, it remains to show that m < g. Since 


mo(p") fixja(P5) = O(n) = o(p")o(n’), 


5.3. 


then 
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mf Kjo(P;) = o(n’), 


but we also have that 


Hence, mfx/r(Pj) = g, so g > m, thereby completing the proof. 


fg = 9(n’), and fxje(P3) = fxsr(P,)f- 


In 85.4 we marry the Galois theory developed in §2.1 with the results developed thus far in 
this chapter to further develop the theory of decomposition of ideals in number fields. 


Exercises 


5.19. 


5.21. 


5.22. 


Let K/F be an extension of number fields, and let I, J be Ox-ideals. Establish each 
of the following. 


(a) If 2 CJ, then N*/" (Wy) CN (J): 
(b) If J and N¥/F(J)Ox are relatively prime, then N¥/"(I) and N¥/F(J) are 
relatively prime 0 p-ideals. 


(Hint: Use Corollary 1.7 on page 27.) 


. Let K/F be a normal extension of number fields, and let p be a prime 9D -ideal that 


is tamely ramified in kK. Prove that 


p’{ An;r, 
where n = |K: F]. 


(Hint: Use Theorem 5.5 on page 190, Theorem 5.11 on page 209, and part 3 of 
Lemma 5.4 on page 197.) 


Let K,/F for 7 = 1,2 be an extension of number fields, and let L = K)K2 be their 
compositum. Prove that 
DrjK, | Dky/rOc- 


(Hint: Use Theorem 5.9 on page 203.) 


Let K,/F for 7 = 1,2 be an extension of number fields, and let L = K,K2 be their 
compositum. Prove that 


LK 
N?@/F (Ay 7x,) | ae 


and 


L:K. 
N¥1/F (Ar ix) | Neg 


(Hint: Use Exercise 5.21 in conjunction with Exercise 5.6 on page 195.) 


. Let L/K be an extension of number fields with L = K(a) for some a € D;. Suppose 


that p is a prime O,-ideal such that 
pt n?/F (f,). 
Prove that for any y € Or, there exists a k(#) € Ox [x] such that 
7 = k(a) (mod pz). 


(Hint: Use Theorem 1.21 on page 82.) 
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In the next exercise, we develop the notion of an infinite prime first mentioned in Foot- 
note 5.8 on page 218. To do so we make use of valuation theory a complete overview of 
which may be found in [54, Chapter 6]. 

First of all, an absolute value on a field F is a function |-|: FR satisfying each of the 
following. 


(a) |z| > 0 for all x € F and |z| = 0 if and only if « = 0. 
(b) |z- y| = || - ly] for all a, y € F. 
(c) |x +y| < |x| + |y| for all x,y € F. (Triangle inequality) 
If the triangle inequality can be replaced by the condition 
jc + y| < max{|z|,|y|} for all a,y € F, (5.21) 


then the absolute value is said to be a non-Archimedean valuation, and otherwise it is called 
an Archimidean valuation. 


Two valuations |x| and |x|, are said to be equivalent if |x| < 1 holds if and only if |x|, < 
1, which is an equivalence relation—see Exercise 1.8 on page 6. An equivalence class of 
valuations on a field F is called a prime of F’, denoted by p, with the valuation in p denoted 
by |-|p and its value at x denoted by |\x|p. An equivalence class of Archimedean valuations 
is called an infinite prime of F' and an equivalence class of non-Archimedean valuations is 
called a finite prime of F. 

If p is an infinite prime of F and @: F ++ C is an embedding of F into C such that |6(a)| 
is in p and @ is a complex embedding, then p is called a complex prime, and if @ is a real 
embedding, then it is called a real prime—see Exercise 2.11 on page 68 for the definitions 
of real and complex embeddings. 

If K/F is an extension of number fields then extensions of p to primes of K are described 
as follows. By Exercise 2.6 on page 63, 0 extends to exactly g = |K : F| F-isomorphisms 
B1,---, PB, of K, which are infinite primes of K and that are the extensions of p. To be 
consistent with the finite case we write 


p= 21 --- Poy. 


5.24. Let K/F be an extension of number fields and p be an infinite prime of F' with 
B1,--- , PB, the primes of K that extend p. Let the ramification number e; = ex, (Bi) 
equal 2 if p is real and $B; is complex and e; = 1 otherwise. Set fj; = F/r(Pi) = 1 in 
all cases. Prove that 


g 
bird = |i : F|. 
i=1 


In the remaining exercises, we provide applications of the above-defined valuations. 


5.25. If F is a field and a,8 € F with |a| < |6| for a non-Archimedian valuation | - |, prove 
that |a+ | = |6|. 


(This says that, with respect to |-|, every triangle ts isosceles.) 
5.26. Suppose that F is a field with a non-Archimedean valuation |-|. Prove that the 


valuation of F can be extended to the polynomial ring F[a] by defining the absolute 
value of f(x) = a9 + a,%+-+-an2” to be | f| = max{|aol,..., |an|}- 
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5.4 Galois Theory and Decomposition 


Trivial personalities decomposing in the eternity of print. 
Virginia Woolf (1882-1941) 
English novelist 


We begin with an illustration of a Galois extension as a motivator for an important concept. 


Example 5.12 Let K = Q(¢35) and F = Q(¢s). Then K/Q is a Galois extension, and 
H = (co) given by 
o:GHG and a:¢5 + ¢5 


is a subgroup of Gal(K/Q) with fixed field Q(¢5). Notice that any rational prime p = 1 
(mod 5) is completely split in Q(¢s) by Corollary 5.13 on page 218. 


Example 5.12 motivates the following. 


Definition 5.12 — Decomposition Groups and Fields 


Let A/F be a Galois extension of number fields with Galois group Gal(K/F), and let P be 
a prime Ox-ideal. Then 


Dp(K/F) = {o € Gal(K/F) : P? = P} 
is called the decomposition group of P in K/F. The fixed field of Dp(K/F), 
Zp(K/F) ={8€ K: B° =B6 for allo € Dp(K/F)}, 


is called the decomposition field of P in K/F. When Gal(K/F’) is abelian, then the decom- 
position group and the decomposition field depend only on p = PN Yr, so in this case, we 
denote them by 

Dy(K/F) and Zp(K/F), 


and call them the decomposition group of p, and the decomposition field of pin K/F. When 
Gal(K/F’) is abelian, we say that K/F is an abelian extension. 


We begin with a fundamental result on decomposition groups. 


Lemma 5.7 — Conjugacy of Decomposition Groups 


Let K/F be a Galois extension of number fields, and let P be a prime Dx-ideal. Then for 
all o € Gal(K/F), 
o 'De(K/F)o = Dpo(K/F). 


Proof. Let T € Dp(K/F) and o € Gal(K/F). Then 
(P7)7 7 = PT? =P, 
Therefore, o~'ta € Dpo(K/F). Hence, 


o 'Dp(K/F)o C Deo(K/F). 
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It remains to verify the reverse inclusion. If y € Dp-(K/F’), then 
p77 — f° which implies that P77” | = P. 
Thus, cya! € Dp(K/F). In other words, y € o-'D(K/F)o, so 


Depo(K/F) Co 'De(K/F)o, 


as required. 


Example 5.13 In Example 5.12 on the preceding page, the decomposition group of any 
rational prime p = 31(mod 35) in the abelian extension Q(¢35)/Q is 


Dp (Q(435)/Q) = Gal(Q(¢35)/Q(¢s)), 


and the decomposition field of p is 


Zp(Q(G35)/Q) = Q(Gs). 


Remark 5.5 Lemma 5.7 on the previous page shows that if K/F is a Galois extension of 
number fields, and P is a prime 9 x-ideal, then for any o € Gal(K/F), 


ao 'Dep(K/F)o = Depo (K/F). 


This is the group-theoretic analogue of the fact established for prime ideals, Corollary 5.1 
on page 190, namely that the prime D,-ideals are transitively permuted by the elements 
of Gal(K/F). In other words, if p is a prime D-ideal with 


g 
pOx = [] 9%, 
j=l 


then the decomposition groups Dp,(K/F) for 1 < j < g are transitively permuted by the 
elements of Gal(K/F’). In the case where K is an abelian extension of F’, then 


Do, (K/F) = Do, (K/F) = Dp(K/F), 


for all natural numbers j,k < g. In other words, in the abelian case, the decomposition 
groups are all the same, thereby justifying the penultimate remark made in Definition 5.12 
on the previous page for the use of the notations D,(K/F) and Z,(K/F). 


The decomposition field is aptly named, as shown by the following. 


Theorem 5.16 — Splitting in the Decomposition Field 


Let K/F be a Galois extension of number fields, and let P be a prime Dx-ideal with 
POOr =p. Then for Z = Zp(K/F), 


|K : Z| = |Dp(K/F)| = exr(p) fx/r(P), 


and if PN Dz = Pz, then 
fajr(Pz) = ezjr(Pz) = 1. 
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Proof. By Theorem 2.4 on page 60, 
|Gal(K/F) : Dp(K/F)| = |Zn(K/F) : Fl. 


By Lemma 5.7, each right coset Dp(K/F)o of Dp(K/F) via o € Gal(K/F’) sends P to P7. 
In other words, if y € Dp(K/F)o, then 


Pua Prt = pe. 


for allt € Dp(K/F). Therefore, Dp(K/F)o = Dp(K/F)r for o,r € Gal(K/F) implies 
that Dp(K/F) = Dp(K/F)ro~', so P + PT? *. In other words, P? = P7. Therefore, 
we have established a one-to-one correspondence between the right coset Dp(K/F)o in 
Gal(K/F) and the primes P?. By Corollary 5.1, these primes are transitively permuted by 
the o € Gal(K/F), so there must exist gx /(p) of them. Hence, 


|\Zp(K/F) : F| = gx/r(p)- 
Thus, by Theorem 5.4 on page 189, 
|Dp(K/F)| = exjr(P) fxr (p)- 


Now we verify the last statement in the theorem. Let Z = Zp(K/F), and Pz =PNZ. By 
Theorem 2.4, K/Z is a normal extension. Therefore, 


Gal(K/Z) = D9(K/F), 
so P? = P for all o € Gal(K/F). By Theorem 5.4, gx/z(Pz) = 1, and 
|K: Z| = exjz(Pz)fxjz(Pz). (5.22) 


Also, 
|K: F| = exyr(p) fer (P)gn/r(P), (5.23) 


and we have already shown that 
|Z: F| = 9x/F(p). (5.24) 
Hence, putting (5.22)—(5.24) together, we get 


ex/F(P) fx/F(P)9xK/F(P) = ex/z(Pz) fxjz(Pz)9K/F(P), 


sO 


ex/F(P) fx/r(P) = ex/z(Pz)fxjz(Pz). (5.25) 
However, by Theorem 5.1 on page 184, 
ex/F(P) = ex/z(Pz)ez/r(Pz), (5.26) 
and 
fer (P) = fjz(Pz) fa/r(Pz). (5.27) 


By comparing (5.25)—(5.27), we get 


ezjr(Pz) =1= fzsr(Pz), 


as required. 
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Corollary 5.14 If K/F is a Galois extension of number fields, and P is a prime D x-ideal 
with PO Or =p, then 

|Zp(K/F) : F| = gx/r(P)- 
Furthermore, if Dp(K/F) is a normal subgroup of Gal(K/F), then p is completely split in 
Zp(K/F). 


Proof. From the proof of Theorem 5.16, we have the first statement. By Theorem 2.4, if 
D>(K/F) is normal in Gal(K/F), then Z/F' is a normal extension where Z = Zp(K/F), 
so by Theorem 5.16, 


fajr(Pz) = fasr(p) =1=ezr(Pz) = ez/r(p), 
where Pz = PM Z. Therefore, 
92/F(P) =|Z:F|= 9K/F(P); 
namely p is completely split in Z. 


Example 5.14 If we let L = Q(¢3, 72), and F = Q, then Gal(K/F) = $3, the symmetric 
group on three letters—see Definition A.1 on page 320. In Example 5.9 on page 191 we 
demonstrated that p = 29 splits into two primes 


pOK = PyP2 


where kK = Q(¥V2), with fxjo(P1) a= €x/a(P;) for j = 2s and fxjo(P2) = 2: Also, 
fr/K (21) = 2, where Q; is the prime O;-ideal over P,;. Thus, 


Zo, (L/Q) = K, 


which is not normal over Q, as demonstrated in Exercise 2.12 on page 63. Similarly, the 
decomposition fields for 22 and Q3 are, respectively, 


Q(¢s V2) and Q(G3 V2). 
In none of these (isomorphic) fields is 29 completely split, since 
|Z0, (L/Q)| = 3 = gx/r (29), 


but fx/o(P;) < 2 for 7 = 1,2. This shows that the normality assumption in Corollary 5.14 
is indeed necessary. 


There exists another important subgroup of the Galois group from the perspective of de- 
composition. The reader unfamiliar with residue classes modulo an ideal should review 
Exercises 4.30—4.32 on pages 163-164 before proceeding. 


Definition 5.13 — The Inertia Group and Inertia Field 
Let K/F be a Galois extension of number fields, and let P be a prime Ox-ideal. Then 
Tp(K/F) = {0 € Gal(K/F):a° =a (mod P) for all a € Dx} 
is called the inertia group of P in K/F, and its fixed field, 
Tp(K/F) ={8 € K: 8° =6 for all o € Tp(K/F)}, 
is called the inertia field of P in K/F.°° 


5.9The T is used for inertia subgroup since it comes from the German Trdgheitskorper, and, similarly, 
Z for the decomposition field comes from Zerlegungskérper. These were the terms used by Hilbert in 
his Zahlbericht, where the theory was published for the first time. However, there is a certain consensus 
that Dedekind knew about the decomposition and inertia subfields, as shown by his papers, which were 
unpublished at the time that Hilbert wrote down his ramification theory. 
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Lemma 5.8 — Inertia and Conjugacy 


Let K/F be a Galois extenion of number fields, and let P be a prime O-ideal. Then, for 
all o € Gal(K/F), 
o'Tp(K/F)o = Tpo(K/F). 


Proof. If tr € Jp(K/F) and o € Gal(K/F), then for a € K, 
ota ‘(a) —a=art(a*(a)) — aa *(a) = a (t(o (a) — a *(a)) € o(P), 
so oT p(K/F)o—' C Top) (K/F). By the same reasoning, 


oT, (9)(K/F)o C T(K/F), 


so we also have the reverse inclusion. 


The following gives a value to the order of the inertia group. 


Theorem 5.17 — Index of the Inertia Group 


Let K/F be a Galois extension of number fields, and let P be a prime 0x-ideal with 
PAOr =p. Then Jp(K/F) is a normal subgroup of Dp(K/F), and 


|Gal(K/F) : Te(K/F)| = fxr (p)9K/F(P)- 
Also, for T = Tp(K/F), and Pr = PN Or, we have that fxjr(P) = 1, er/r(Pr) = 1, and 
ex/T(P) = ex/r(p). 
Proof. Let Ox /P = Kp and Or /p = Fy. Define a mapping 
w: Dp(K/F) 6 Gal(Kp/F,), 
by 
vo) =o, 


where o(@) = oa, with @ being the residue class of a in Ox /P. Thus, o € Gal(Kp/F,), 
and 7 is a homomorphism. By the definition of Tp(A/F'), we get that ker(w) = Tp(K/F), 
so by Theorem A.5 on page 328, 


Dp(K/F)/Te(K/F) = Gal(K9/F,), 


and J(K/F) is a normal subgroup of D»(k/F')—see also Lemma 5.7 on page 221. Since 
Theorem 5.16 on page 222 gives us that |Dp(K/F)| = ex/r(P)fxsr(P), and by Defini- 
tion 5.1 on page 182, |Kp : Fy| = fxsr(P), then 


|Tp(K/F)| = exr(P), 


|Gal(K/F) : Te(K/F)| = fxjr(P)9K/r(P), 


which is the first result. Next, we show that fx/r(P) = 1. Let Tp, = Or/Pr. By the 
definition of inertial degree, we need only show that 


|Kp:Tp,| = 1. (5.28) 
To show this, we demonstrate that if 6 € Ke, then 


f(x) = (@ — B)® € To, [a], 
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where e = |Jp(K/F)| =|K : Tp(K/F)|. Once shown, then every element of Gal(Kp/T>.,) 
sends 6 to a root of f(x), namely £ itself, so Gal(Kp/T>p,,) is trivial and (5.28) holds. 


Let a € Kp. Then 
g=)= Jl @-a’)€ 97x]. 
o€T(K/F) 


By reducing coefficients modulo P, we get that g(a) € Ke[az], so by the definition of 
Tp(K/F), g(x) € Tp, [x], and a? =@ so 


and we have verified (5.28) as required. 
Now we show that er/p(Pr) = 1. Since we have shown above that 


|T.: F| = |Gal(K/F) : Tp(K/F)| = fi/p(P)g9K/r(P), 


then er/F(Pr) = 1, and since fx/r(P) = 1; then ex/F(P) = ex/F(P)- 


Maintaining the notation and assumptions of Theorem 5.17, we have the following conse- 
quence. 


Corollary 5.15 For any Galois extension K/F, we have 
|T9(K/F)| = exyr(p), 


and if Dp(K/F’) is a normal subgroup of Gal(K/F’), then each of the gx /p(p) prime Oz- 
ideals is inert in T where Z = Zp(K/F), and each prime P-ideal is an e'” power in 
K. 


Proof. From the proof of Theorem 5.17, |Jp(K/F)| = ex;r(P). By Corollary 5.14 on 
page 224, there exist g = gx /r(P) prime Z = Zp(K/F)-ideals above p. Hence, there exists 
exactly one prime 0 x-ideal above each of the g prime  z-ideals. Thus, the inertial degrees 
of each of the g prime Z-ideals in T is the same. To prove that each prime 9 7-ideal is 
inert in 7’, it suffices to prove that each is unramified in T. However, from Theorem 5.17, 
er/r(Pr) = ez/r(Pz) = 1. The result now follows from Theorem 5.4 on page 189. Hence, 
in consideration of the above results, PrO x = P°, where e = ex /p(p)- 


In the following, an intermediate field in the extension K/F means an extension field of F 
contained in K. 


Corollary 5.16 — Intermediate Fields as Decomposition and Inertia Fields 
Let K/F be a Galois extension of number fields, and let P be a prime Dx-ideal. Then 
(a) If L is an intermediate field, then ZL is the decomposition field of P in K/L. 

(b) If ZL is an intermediate field, then LT is the inertia field of P in K/L. 


Proof. (a) Let Fy = Opz/Pi where Py = PN OZ, Fo = OL/P2 where Pp = PN Oz, 
F3 = Oz/P3 where P3 = PN Oz, and Fy = Oraz/Pa where Pg = PN Ozaz. Then 
Gal(F/F2) may be embedded into Gal(F3/F 4) via restriction of automorphisms. However, 


Gal(F3/F4) = |F3 : Ful = fzytaz(P3) = 1, 


by definition, so fpz/,(P1) = 1. Furthermore, Pz cannot split any further by Theorems 
5.16-5.17. Hence, LZ is the decomposition field of Pz in K/L. 

(b) This is proved in a similar fashion to that given part (a), by comparing the Galois 
groups of LT/L and T/(LNT). 
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Corollary 5.17 Let K/F be a Galois extension of number fields, and let P be a prime 
9 -ideal. Then each of the following hold. 


(a) Zp(K/F) is the smallest intermediate field L such that P is the only prime 9 x-ideal 
lying over Pp = PNOD_y. 


(b) The field Zp(K/F'’) is the largest intermediate field L such that 
et/F(Pr) are fryr(Pr) =1. 
(c) The field Tp(K/F) is the largest intermediate field L such that 
er/F (Pr) =1. 
(d) The field Tp(K/F) is the smallest intermediate field Z such that 
ex/i(P) =|K :L. 
Proof. (a) Suppose that P is the only prime Ox-ideal lying over Py. Since Gal(K/L) 


transitively permutes the prime D,-ideals above P; by Corollary 5.1 on page 190, then 
Gal(K/L) is forced to be in Dp(K/F). Thus, by Theorem 2.4 on page 60, 


which establishes (a). 


(b) If 
er/r(PL) = frsr(Px) = 1, 


then by Theorem 5.1 on page 184, 
ex/e(P) = exyt(Persr(Pr), 
and 
fpr?) = ft?) fryr(Pr). 
Therefore, 
ex/F(P) = exyt(P) and fxr(p) = fx/z(P). 
Thus, for Z = Zp(K/F), by Theorem 5.16 on page 222, 


ex/F(P) fxsr(P) =|K: Z|. 


Also, 
|: ZL| = exyz(P)fx/x(P), 
since the decomposition field of Py in K/L is ZL, by part (a) of Corollary 5.16. Thus, 
Z = ZL, 80 L C Z, which is (b). 
(c) If er;r(PL) = 1, and T = Tp(K/F), then by part (b) of Corollary 5.16, LT is the 
inertia field of P in K/L. Thus, 
exjr(P) =|K :T| =|K: LT| =exz(P). 


Hence, T = LT, so L C T, which verifies (c). 
(d) If Py is totally ramified in K, then by part (b) of Corollary 5.16, 


Therefore, LT = L so T C L, which completes the entire result. 
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Corollary 5.18 Let K/F be a Galois extension of number fields, and let P be a prime 
Ox-ideal with PA Or =p. Then if Dp(K/F) is normal in Gal(K/F), p is completely split 
in an intermediate field L if and only if L C Zp(K/F). 


Proof. If p is completely split in L/K, then ep;r(Pi) = fryr(Pt) = 1, where Pp = PNO_. 
Therefore, by part (b) of Corollary 5.17, L C Zp(K/F). Conversely, by Corollary 5.14 on 
page 224, p is completely split in Zp(K/F), so a fortiori it is completely split in L. 


Diagram 5.1 — Inertia, Ramification, and Decomposition 


Let K/F be a Galois extension of number fields, and let P be a prime Ox-ideal. Then 
the following illustrates the theory developed above. In what follows, Kp = Ox/P, T = 
Tp(K/F), Pr = PNT, Tp. = Or/Pr, LS Zp(K/F), Pz = PNZ, LP y = Oz/Pz, 
p=PNF, and Fy = Or/p. 

Primes Groups Fields Degrees Residue Fields 


P K Kp 
Jo(K/F) ex/r(P) 
Pr al T Tp, 
Do(K/F) fxjr(P) 
Pz ‘all Z Lez 
Gal(K/F) ! 9K/F(P) 
p Fi Fy 


The above diagram is augmented by the following one that motivates an important concept. 


Diagram 5.2 — Residue Class Fields and Their Global Counterparts 

Let K/F be a Galois extension of number fields, and let P be a prime Dx-ideal with 
Kp = Ox/P. Every element of Gal(K/F’) restricts to an automorphism of Ox. Thus, if 
oa € Dp(K/F), there is an induced mapping =): On 4 K> with ker(q) = P. Therefore, 
each o € Dp(K/F) induces an automorphism @ of Kp in a fashion such that the following 
diagram commutes.°?:!° 


On —> OK 


i. a 


Yea es 4 


Also @ fixes the finite field Or /p = Fy, where p= PNOp. Hence, o € Gal(Kp/F,), so this 
yields a mapping 

p: Dp(K/F) + Gal(Kp/F,), 
which is a group homomorphism since products in Dp(K/F) correspond to products in 
Gal(Kp/F,). Also ker(p) = Tp(K/F), so Tp(K/F) is a normal subgroup of Dp(K/F')—see 
Exercise 5.43 on page 253 for a generalization of this fact. This tells us that the quotient 


5.10\We remind the reader that a commutative diagram, in this case, means that we have the equality of 
composite maps woo=cor. 
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group Dp(K/F)/Tp(K/F) is embedded in Gal(Kp/F,). A fundamental fact, which is 
buried in the proof of Theorem 5.17 on page 225, is that p is an epimorphism, so 


De(K/F)/Tp(K/F) & Gal(Kp/Fp). 


From Exercise 2.16 on page 64, the Galois group Gal(p/Fy) is cyclic of order fx/p(P). If 
? is unramified in K/F’, then by the aforementioned proof, 


(Tp) = Gal(Kp/Fy) = De(K/F), 


and there is a unique op € Dp(K/F) such that op + op. The generator of the decom- 
position group in this case is a very distinguished element, which is named as follows—see 
Biography 2.3 on page 80. 


Definition 5.14 — The Frobenius Automorphism 


If K/F is a Galois extension of number fields, and P is a prime O,x-ideal unramified in 
K/F with PO Or =p, then Dp(K/F) is cyclic and has generator: 


K/F 
Pp c 
called the Frobenius automorphism of P in K/F, given by 


When Gal(K/F’) is abelian, then the Frobenius automorphism depends only on p and we 
write Por 


where as usual pO x is the product of the prime 9 x-ideals lying over p. In the abelian case, 


(4) is also called the Artin symbol—see Remark 5.7 on page 239. 


Definition 5.14 allows us to state one final consequence of Theorem 5.17. 


Corollary 5.19 Let K/F be a Galois extension of number fields, with P a prime Ox- 
ideal. If P is unramified in K/F, then Dp(K/F) is cyclic of order fx/r(p) generated by 
the Frobenius automorphism of ? in K/F’. In particular, P is completely split in K if and 


only if (4) =1. 


Application 5.1 —The Frobenius Automorphism on Cyclotomic Galois Groups 


Let ¢, for n € N be a primitive n“” root of unity, and set K = Q(¢,). We now apply the 
Frobenius automorphism to show that Gal(A/Q) is isomorphic to (Z/nZ)*, the multiplica- 
tive group of nonzero elements of Z/nZ. 

Any o € Gal(K/Q) is determined by its action on ¢,, namely ¢7 = ¢"-, where ng € Z is 
uniquely determined modulo n. Also, this action is independent of the choice of ¢,, since a 
acting on any primitive n“” root of unity raises it to the power nz, given that all roots of 
unity are powers of ¢,. Thus, if 0,7 € Gal(K/Q), then 


(her = 27 = ((te)T = creme, 
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Thus, non; = 57 (mod n). In other words, the mapping defined by 
a € Gal(K/Q) ng € Z/nZ", 


is a homomorphism. Since each o uniquely determines n,, then this map is a monomor- 
phism. It remains to show that it is an epimorphism. Let p { n be a rational prime. 
Therefore, the Frobenius automorphism 


(82) 


is well-defined since p is unramified in kK. If P is any prime Ox-ideal over p, then this 
automorphism is given by 
a"’er =a? (mod P). (5.29) 


From Exercise 1.54 on page 43 it follows that if f(a) = 2" — 1, then 


[]@-) =f @), 


k 


where the product runs over all nonnegative k <n-—1 with p{k. Since f’(¢?) = npr) 
then [],,(¢2 — ¢*) ¢ P. Hence, a"’» = a? (mod P) represents the residue class of p modulo 
n. In other words, the mapping g +} ng is an isomorphism of Gal(K/F'’) onto Z/nZ*. 


We now illustrate how the Frobenius automorphism can be used to prove Gauss’s quadratic 
reciprocity law. 


Application 5.2 — The Quadratic Reciprocity Law via Frobenius 

Let K = Q(¢,) where p > 2 is prime and ¢, is a primitive p‘” root of unity. Set p* = 
(—1)®-)/*y. Then by Exercise 5.35 on page 232, Q(./p*) = F is a quadratic subfield of 
K. In fact, it is the unique quadratic subfield of K, since Gal(K/Q) is cyclic of order p—1, 
given that it is generated by o where o(¢») = ¢% with g being a primitive root modulo p. 
By Application 5.1 on the preceding page, Gal(#’/Q) corresponds to the subgroup F%, of 
nonzero elements of the field of p elements, F,. Hence, if gq 4 p is any odd prime, and 


(8) 


is the Artin automorphism of g in K/Q, then its restriction to F, 


r= (28) -(22), 


q 
is the identity on F’ precisely when oq : ¢p +> G7, where q is a square in F}. Otherwise, it 
is the nontrivial automorphism, with q being a nonsquare in F>. Thus, by considering the 
natural identifications: 


F 


(22). (3) cm 


we get 
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by the very definition of the Legendre symbol. From another perspective, since F = Q(,/p*), 
then q splits in F if and only if 


and q is inert exactly when 


so for odd q, we get 


(22)-(2) om 
()-()-@)""@) 


However, 
So 


which is Gauss’s Quadratic Reciprocity Law. 
Application 5.2 also suggests how a rational prime splits in a cyclotomic field. 


Application 5.3 — Frobenius and Splitting in Cyclotomic Fields 


Suppose that n € N, n > 1, and without loss of generality n 4 2(mod 4), since K = Q(¢,) = 
Q(¢2n) for n odd by Corollary 1.17 on page 41. Then by Application 5.1 on page 229, 


G = Gal(K/Q) © (Z/nZ)*. 


(9) 


is defined for p{n, and depends only on p since G is abelian. Thus, a7? = a? (mod pO x), 
for alla € Ox = Z[¢,], by Corollary 5.13 on page 218. Hence, p is completely split in K 
if and only if p = 1(mod n), which is tantamount to saying that o, = 1, namely a = a? 
(mod px) for all a € Oxn—see Exercises 4.31—4.32 on page 164. 


Thus, 


Exercises 


5.27. Let Fy the finite field of g = p? elements for some prime p. A map x from Fj to the 
multiplicative group of roots of unity in C* such that 


x(ab) = x(a)x(b) for all a,b € FG 


is called a (multiplicative) character on F*.°'* If x(a) = 1 for all a € F%, then x is 
called the trivial character on Fj, denoted by e. It is convenient to extend the domain 
of definition from Fj to F, by setting y(0) = 1 if x = «, and x(0) =O if x Ae. The 
order of x is the least m € N such that ,” =e. Establish each of the following. 


5-11Notice that the Legendre symbol (3) is an example of a character on F, by considering Gp) as a coset 
of a modulo p—see Exercise 5.33. 
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(a) x) =1. 

(b) x(a)¢~* =1 for alla € F¥. 

(c) x(a~*) = x(a)~! = x(a) for all a € F%, where x(a) is the complex conjugate of 
x(a). 


Exercises 5.28-5.382 will be with reference to characters as defined in Exercise 5.27. 


5.28. Let y be a character on F,. Prove that 


q-1 : 
. | 0 ify #e, 
Sew-{ ° ify =e. 


j=0 


5.29. Prove that the characters on F{ form a multiplicative group, denoted by €h(F;), 
via the definition of multiplication and inverses given by yA(a) = yx(a)A(a), and 
x7 (a) = (x(a))~!, for a € F% and characters y and 4. 


5.30. Prove that €h(F), given in Exercise 5.29, is cyclic of order g— 1 and that if a € Fj 
with a # 1, there exists a character y on F, such that x(a) 4 1. 


Henceforth, if x is a character on F{, then x is said to be of order n, where n | q-1, 
provided that n is the smallest such value for which x” = e. 


5.31. Suppose that a € Fj with a # 1. Prove that 


S> x(a) =0. 


xECh(FF) 


5.32. Suppose that a € Fj, andn € N with g= 1(mod n) such that x” = a has no solution 
for any x € Fy. Prove that there exists a character ~ on F, of order n such that 


x(a) # 1. 


5.33. For an odd prime p, let ) denote the Legendre symbol with (3) = 0 for convenience, 
and for k € Z, set 


P 


5.34. With reference to Exercise 5.33, prove that G?(1) = (—1)°="p. 
5.35. Let p > 2 be a prime, and set p* = (—1)®-)/*p. Prove that®-!? Q(V/p*) C Q(G). 


5.36. Let p 4 q be rational primes with p odd, and let d be a fixed divisor of p— 1. Prove 
that g = «4(mod p) is solvable for some x € Z if and only if g is completely split in 
the unique subfield of Q(¢,) having degree d over Q. (Observe that Gauss’s Quadratic 
Reciprocity Law follows from this, the case where d = 2—see also Application 5.2 on 
page 230.) 


5.12T Chapter 5, we will generalize this result considerably with a proof of the celebrated Kronecker-Weber 
Theorem (see Theorem 5.23 on page 244). 
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5.5 Kummer Extensions and Class-Field Theory 


I don’t like people who have never fallen or stumbled. Their virtue is lifeless and it 
isn't of much value. Life hasn't revealed its beauty to them. 

From Part 2, Chapter 13, Section 12 of Doctor Zhivago (1958) 

Boris Pasternak (1890-1960) 

Russian novelist and poet 


In this section, we commence with another type of extension distinct from the quadratic 
and cyclotomic extensions considered in §5.4, which will lead us into class-field theory that 
is an aspect of “higher algebraic number theory.” 


Definition 5.15 — Kummer Extensions 
Let F be a number field containing a primitive n‘” root of unity for a given fixed n € N, 
and set f(a) = 2" — a for a given a € F. Then K = F(#/a) is called a Kummer extension 


of F, where */a is a root of f(z). 


Lemma 5.9 — Kummer Extensions are Cyclic 
If kK = F(%/a) is a Kummer extension of F’, then K is a normal extension of F’ and 
Gal(K/F) is cyclic of order n. 


Proof. Let €; = (} for 7 = 1,2,...,n be all of the n“” roots of unity in F, where ¢, is a 
primitive nth root of unity. If o is an F-isomorphism of K, then 


a: Yare VaeF, 


for some j = 1,2,...,n, which is another root of «” —a. Thus, K/F is a normal extension. 
If o,,0¢ € Gal(K/F), are given by o;(%/a) =; v/a for j = k,£, then 


onoe( %/a) = on (Ee Va) = Exner Va = Even Va = ovo ( /a), 


so Gal(K/F) is abelian. Select o € Gal(K/F), such that ¢ : Ya ¢, </a, where ¢; is 
a primitive nth root of unity. Then o” = 1 but o” 4 1 for any natural number m < n, 
because ¢7" # 1 for any such m, so o generates Gal(K/F’). In other words, Gal(K/F)) is 
cyclic of order n. 


Theorem 5.18 — Decomposition in Kummer Extensions 


Let K/F be a Kummer extension of degree n with K = F'(8) such that 6° =a € Or. If p 
is a prime 0 -ideal such that na ¢ p and g is the maximal divisor of n such that 


xi =a (mod p), 


has a solution in x € Op, then 
pOrK Pye Pas 


for distinct prime 0 x-ideals P;, 1 < 7 < g, and 
fg=|K: Fl, 
where f is the minimal exponent such that 


B! =a (mod p). 
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Proof. Let F, denote the field O-/p, and let % denote the image of x in Fy under the 
natural map from Of. 


Claim 5.11 If f € N is the smallest exponent such that 


Bp =aeF,, 


then m(x) = af — @ is irreducible over Fy. 


Since B” € F,, thenn > f. Ifn = fqtr where gq ¢ Nand0 <r < f, thena” = a”-f4 ¢€ F,, 
so r = 0, by the minimality of f. Thus, f | n. Since a primitive n-th root of unity, Gr € Fp, 


then ¢2/f = Cr € Fy. Also, 
-a= 1 x — BC}). 


If g(x) properly divides (x/ —@) for some g(x) € Fy [2], then g(0) = Bey € F,, where y € Fy 
and k < f. Hence, by the minimality of f, we must have that k = f, a contradiction. This 
establishes Claim 5.11. 


Let Kp denote the field D%/P where P is a prime Ox-ideal over p. Since a root of m(x) 
generates the field extension Kp/F,, then 


faye (?) = fxyr(P) = |Ko : yl = f, 


where the penultimate equality comes from Claim 5.1 in the Proof of Theorem 5.3 on page 
187, and the last equality comes from the fact that 


Fy [x] 
(m(x)) 


which is a result of (A.3) on page 325. Hence, 


Kp = = F,(8), 


Kp: Fy] = deg p, (m) =f. 
Since na ¢ p, then p is unramified in kK. Thus, by Theorem 5.4 on page 189, 
g=|K: F\/f. 
Since f is the minimal divisor of n such that 
x! = 8 (mod p), 
has a solution x € Op, then g is the maximal divisor of n such that 
«7 = 6 (mod p) 


has a solution in Dp, so 
pOrK = P1--- Poy, 


and this secures the proof. 


A special case of Theorem 5.18 is worth isolating, especially in view of the fact that this 
will be one of the stepping stones in concluding Kummer’s proof of FLT for regular primes, 
which we will see in Theorem 5.22 on page 240. 
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Corollary 5.20 Suppose that p is a rational prime and F' is a number field containing a 
primitive p*” root of unity. Then if a is not a p’” power of an element of Op, x? — a is 
irreducible over F and Gal(/F) is cyclic of order p. In the latter case, one of the following 
two events occurs for any prime 0 -ideal q, where pa ¢ q. 


(a) The congruence 
x? =a (mod q) (5.32) 


has a solution, in which case gx /r(q) = p, and fx;r(q) = 1 = ex;r(q), namely q is 
completely split in Kk. 


(b) The congruence (5.32) has no solution, in which case fx/r(q) = p, and gx/r(q) = 
1 = ex /r(q), namely q is inert in K. 


We need the following result for the ensuing development. 


Lemma 5.10 Let K = F(%/a), where a € Op is not ap” power in Op, ¢ € F, and p is 
a rational prime. Then any prime 9 ;-ideal q satisfies exactly one of the properties 


(a) ex/r(q) =1= fr(q) and gx/r(q) =p, in which case q is completely split in K. 
(b) gr (4) =1=ex,r(q) and fx/r(q) =p, in which case q is inert in K. 
(c) fer (4) =1= 9x /r(q) and ex,/p(q) =p, in which case q is totally ramified in K. 


Proof. This is immediate from Theorem 5.4 on page 189 and Lemma 5.9 on page 233 . 


Remark 5.6 The case where pa € q in Corollary 5.20 deserves to be settled as well since 
it has fundamental consequences for the aforementioned proof by Kummer. The following 
observation will assist the reader with the next result. If a € q, then aDOr = q"J, where 
n € N and J is an O--ideal not divisible by q. In this case, we may assume without 
loss of generality that p {n. To see this, assume p | n, let y € q with 7? ¢ gq, and set 
a, = a(y—"/?)?. Then a root of «? — a, generates the same field extension K/F, since a 
root 6; of the latter equation satisfies 6? = a, = a(y—"/)?. Therefore, 6, € F(8), where 

” = q and conversely 3 € F'(61). Notice, as well, that once this translation is made, then 
the exact power of q dividing a, is equal to n — (n/p)p = 0, so gcd(aD p,q) = 1. 


Theorem 5.19 — Kummer p-Extensions 


Suppose that p is a rational prime and F is a number field containing a primitive p“” root 
of unity. Set K = F(8) where 8? = a € F, and a is not the p*” power of an element of 
Op. If pa € q, where q is a prime O-ideal, then one of the following occurs. 


(a) Ifa €q, then aDp = q”"I, where n EN, and J is an Op-ideal with q{ J. If p{n, 
then q ramifies in K, namely ex;r(q) = p, and fxsr(q) = 1 = gx /r(q)- 


(b) Ifa ¢q, but p € q, namely qnZ = (p), then Or(1—¢,) = q"J where J is an D p-ideal 
not divisible by q and n € N, and one of the following occurs. 


(i) The congruence 
x? =a (mod q”?*") (5.33) 


has a solution x € Op, in which case q is completely split in A. Conversely, if q 
is completely split in K, then congruence (5.33) has such a solution. 
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(ii) The congruence (5.33) has no solution in O-, but the congruence 
x? =a (mod q”"”) (5.34) 


has a solution x € Op, in which case q is inert in Kk. 
(iii) The congruence (5.34) has no solution in Oy, in which case q is totally ramified 
in Kk. 


Proof. We begin with an observation, the proof of which is similar to the demonstration 
given in Remark 5.6 on the preceding page. 


Claim 5.12 We may assume without loss of generality that 


q | ap, but q?7+aDp. 


Let y € q, and y ¢ q?. Since ged(p,n) = 1, there exist r,s € Z such that rp + sn = 1. Let 
ay, = a*®y"?. Then a root of x? — a, generates the same field extension K/F. To see this, 
we observe that if 


BP = ay = ay"? = B97? = (87), 
then 
By = Bey" CE € F(8), 


for some nonnegative integer /. Conversely, 
at — ast ynTP ans a(as—t)ynrP = a(aW™P)yrrP pany alan ty"). 
Therefore, 
a= at(ay"y”, 
so a € F'(,), as above. Hence, F'(G) = F(61) as asserted. 


From the choice of 7, the exact power of q dividing ajDp = aS Opry"?Op is qt"? = q. 
Hence, aD r = qI, where q{ I. This is Claim 5.12. 
Let 
Q = ged(qOx, BOx). 

Then 

OP = ged(q?Ox, ADK) = Ox. 
By Theorem 5.4 on page 189, Q is a prime ideal so q is totally ramified in K. This completes 
the proof of (a). 
To establish part (i) of (b), we first assume that q is completely split in K, so let 


qn = Q1---Qp, 


I 


where the Q; are distinct prime Ox-ideals. Thus, fx/r(Q;) = 1 = exr(Q;) for 7 
1,2,...,p. Therefore, O?N OF # q”—! for m EN, since if we have that or | (qoe) = 
q’ 1 OK, then ex/p(Q;) > 1, a contradiction. 


Claim 5.13 07 1 Op =q”™ for any m EN. 
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We use induction on m. If m = 1, then the result holds by Lemma 5.1 on page 182. Assume 
the induction hypothesis, that the result holds for m — 1. Then 

q” c oF AORrC oe AOp= qos, 
with OPN OFFA q”—!. Thus, oO”? 1 Or = q"™, which is Claim 5.13. 
By Claim 5.13, O-/q"?*? is a subring of De /ORP However, since q is completely split 


in K, then |Ox/Q; : Or/q| = fxsr(a) =1, 80 |Or/qr?*}| =|Dx/O"?™|, by Exercise 2.40 
on page 82. Therefore, there exists a y € Or such that 


B= (mod Q7?*"), 


namely 
ee | (y+ Boe 
Thus, 
NE/E (Q; prt? | NK/F (4 — B)Ox. 
However, 
Nx F(z — 8) = 2? — BP = 2? —a, 
so 


Gers gee) 


which means that 
x? =a (mod q”?*"), 


has a solution x = y € Or. 
Conversely, let (5.33) have a solution x = 7 € Op. Select u € q~” with u ¢ q~"*"+, so that 


Opuq” =I is an Op-ideal. We have that v = u(y — 8) is a root of (a — wy)? + uPa. 
Claim 5.14 (a — uy)? — uPa € Opa]. 


Since qr-) | Opr(1—¢,)?71 by hypothesis, then for all j € N such that j < p—1, we have 


n(p—1)—nj > 0, so 
a Ind € Op. 
J 


p-l 

(2 =a)? + 0a = (1) (2) whet — uP? — a), 
; J 
j=0 


Since 


and y? —a € q”*1, then u?(y? — a) € q since u € q~”. This completes Claim 5.14. 
By Claim 5.14, v € Ox, and so are the other roots, u(y—¢37') for 7 = 0,1,...,p—1. Set 


Q; = ged(qOx, u(y — G-*B)Ox). 
Then 9; 4 Ox forO< 7 Fk <p—1 since 
a | N/F (Dic) = u(y? — a). 


Also, 
qOK = Q1--- Qn, 
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since every element of a1 Q; is a sum of elements of the form 


=. 


(05 + 73uly — Gp *B)) = 7 + TUP(y? — a) € GOK, 
a 


Jj 


where oj,0 € qOx and 7;,7 € Ox. Thus, 9; Or = q for each such j and each Q; is 
distinct by Theorem 5.4 on page 189. This completes the proof of part (i) of (b). 
If (5.34) has a solution « = y € Op, then as in the proof of part (i), v = u(y — 8) € Ox 
with minimal polynomial 
mxje(v) = (e— uy)? — uP, 

and 

OK/F(v) = mig /p(v) = p(up)?", 
with ged(6x /7(v),2) = 1 for any prime Ox-ideal Q dividing q. Thus, by Theorem 5.9 on 
page 203 and Corollary 5.7 on page 210, q is unramified in Kk. By part (i), and Lemma 5.10 
on page 235, q must be inert in kK. This secures part (ii). 
For part (iii), assume that (5.34) on page 236 is unsolvable in Op, and let ¢ be the largest 
exponent such that x? = a(mod q’) is solvable in Or. By Exercise 4.31 on page 164, we 
must have that @€ N. 


Claim 5.15 pf é 


Suppose that y € O- such that for some natural number t < n — 1, we have a solution 
7 € Or to the congruence 
y? =a (mod q”). 
Suppose further that \ € Op such that A? = 0(mod q‘), but \? 4 0(mod q‘*!). Then for 
any w € Op, 
(y+ Aw)? = y? + Pw? (mod q’?*"). 


However, since w? ranges over all residue classes modulo q, we may chose w such that 
a= (y+ Aw)? (mod q‘?t"), 
a contradiction to the hypothesis. However, since € < np, then p{ &. This completes Claim 


5.15. 


By Claim 5.15, we may select natural numbers t,r <n—1 such that = tp+r. Let u € q7 
with u ¢ q-‘t, and set v = u(y — 8), which is a root of (2 — wy)? — uPa. By a similar 
argument to the above, v € Ox and q’ is the exact power of q dividing 


t 


Nxjr(v) = u(y? — a). 


Thus, 

gcd(qD x, v0 x) 
is an Ox-ideal distinct from Ox and qOx. Hence, qOx is not a prime Ox-ideal, and by 
part (i), qDx« is not completely split in kK. By Lemma 5.10, q must be totally ramified in 
K, 


A direct consequence of Corollary 5.20 and Theorem 5.19 on page 235 is the following 
important unramified extensions result. 
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Theorem 5.20 — Unramified Kummer Extensions 


Let F' be a number field, a,¢, € F such that a is not a p” power in F, p a rational prime 
such that K = F(¥/a), and ged(aDp,pDr) = 1. Then Dx/p = Op = (1) if and only if 
both of the following hold:°:8 


(a) aDOp =I” for some D;-ideal J, and 
(b) There exists a y € Op such that 


y? =a (mod (1—¢,)?OFr). 


Remark 5.7 The cyclic unramified extensions described by Theorem 5.20 play an impor- 
tant role in Kummer’s proof of Fermat’s Last Theorem for regular primes in the second 
case. In fact, this is a beginning of an introduction to an aspect class-field theory. We will 
not develop the tools to discuss this area in depth, but we will describe some fundamental 
aspects, since they pertain to our completion of Kummer’s aforementioned proof. 

Recall from Definition 5.14 on page 229, when K/F is an unramified Galois extension of 
number fields, then the Frobenius automorphism Ge is defined for any prime 9 x-ideal 
?. Thus, for any unramified Galois extension of number fields K/F, we may define the 
Artin map 


OK/F : Tnx b> Gal(K/F) 


via 
“ (K/F\% 
ox/r(1) = |] ( t ) ; 
ra j 
j=l 
where I € Ia, with I = jaa Py for distinct D-ideals P;.°-'4 The Artin map may be 


shown to be an epimorphism, so 
In fact, a fundamental result of class-field theory says more. 


Theorem 5.21 — Fundamental Theorem of Class-Field Theory 
If F is a number field and K“) is the maximal unramified >! abelian extension of F’, called 
the Hilbert Class Field, then 
Gal(K“ /F) = Co, 
via 
oxmyr Ta a) > Gal(K/F). 
Furthermore, since K“) contains every abelian unramified extension of F, then for a tower 


FCKCKY, 
[Are ee ale or) 


5-13Note that there is no ramification at the infinite primes for odd primes p since F is totally complex in 
that case. 

5-14The Artin map may be defined for more general extensions, which may be ramified, by excluding a. set 
of ramified primes, a necessarily finite set by Corollary 5.7 on page 210. However, for our purposes herein, 
we need only look at the special case of unramified extensions. Also, note that from Theorem 5.20, if K/F 
is an abelian unramified extension, then the Artin map depends only on the ideal class of a given ideal I. 
5.15In this context “unramified” also excludes those infinite primes that ramify (see Footnote 5.8 on 
page 213). 
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Proof. See [33]. 


An immediate and important consequence of this result from class-field theory is the fol- 
lowing, which the reader should compare with Exercise 3.33 on page 121. 


Corollary 5.21 A prime 9 p-ideal p splits completely in K“) if and only if p is a principal 
ideal. 


Remark 5.8 Corollary 5.21 tells us that the Hilbert class field K“) of F is characterized by 
the fact that the primes that split completely in K“ are precisely the principal prime 9 p- 
ideals. Note that in Theorem 5.21 on the preceding page, the association of the Galois group 
Gal(K/F) with Co, explains why K“ is called a class field. Moreover, the theorem 
shows that there is a one-to-one correspondence between unramified abelian extensions K of 
F and subgroups H of the class group Cp,. Furthermore, if the extension K/F' corresponds 
to the subgroup H C Cp, then the Artin map induces an isomorphism 


Co, /H © Gal(K/F). 


This last comment may be taken to be class-field theory for unramified abelian extensions. 
This illustrates the central theme of class-field theory, namely that the unramified extensions 
of a given number field F are classified in terms of the subgroups of the ideal class group 
Cp,- In other words, the class of unramified extensions are classified in terms of data 
intrinsic to F’. 


In the special case established in Theorem 5.20 on the previous page, we see that p | 
hp,- This is enough to prove a crucial result that will allow us to complete Kummer’s 
aforementioned proof. The reader is encouraged to solve Exercise 5.39 on page 243, which 
is related to the following lemma. Recall, as cited on page 151, that p is regular if p{ ho, 
where F = Q(G,). 


Lemma 5.11 — Kummer’s Lemma 

Let p be a regular prime, and let F = Q(¢,). If u € Lo, such that 
u=z (mod pO-p) 

for some z € Z, then u = v? where v € Up,. 


Proof. Let K = F(/u), where ¥/u is a real root of x? — u. If u is not the p*’-power of an 
element of Up,, then K/F is a nontrivial Kummer extension. Since 


POF im (1 _ (a ae 


by Example 5.8 on page 190, then by Exercise 4.32 on page 164 the hypothesis of Theorem 
5.20 is satisfied, namely K/F is an unramified extension. Therefore, by Theorem 5.21, 
p|ho,, a contradiction to the regularity of p. 5.16 


Theorem 5.22 — Kummer’s Proof of FLT Case II for Regular Primes 


If p is an odd regular prime, then (4.14) on page 149 has no solutions in rational integers 
x,y,z with p | ryz. 


5.16Without the use of Theorem 5.21, the proof of Kummer’s lemma is long, and relatively difficult by 
comparison since it involves Kummer’s use of p-adic numbers. For instance, see [5, pp. 367-377]. By 
employing the elegant Theorem 5.21, even without proving it, we get an insight into the power of class-field 
theory, and it allows us to complete Kummer’s proof of FLT for regular primes with less difficulty. 
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Proof. Clearly, we may assume without loss of generality that gcd(x,y,z) = 1, so p | Zz 
and p {xy may also be assumed without loss of generality. Set z = p*z, with k € N, and 
gced(z1,p) =1. If F = Q(G,), then by Example 5.8 


pOr = (1 _ Cp)? +4, 
where u € Up,. Thus, (4.14) becomes 
a? + y? + uPk(1 — Gp)??? = 0, (5.35) 


where n = k(p—1) € N. To prove the theorem, it will suffice (a fortiori) to prove that 
(5.35) cannot hold when z, y, 21 € Or with x,y, z relatively prime to 1 — q. 
We use proof by contradiction. Assume that (5.35) is solvable for some such a, y, z1 € Or, 
and let n € N be the smallest value for which it holds. Rewriting (5.35) as an ideal equation 
we get, 

p-1 

[[@+ Gy) = pers”, (5.36) 

j=0 
where p is the prime --ideal (1 — ¢,), and J is an Op-ideal. Although long, the proof 
amounts to essentially a descent argument where we contradict the minimality of n by 
showing that (5.36) holds for n — 1. 
Since n € N, then for 7 > 0 

p | (a + Gy). 
However, 
at Gy = a+ Cy — a - *)y. 


Therefore, since p | (1 — ¢-*), then p | (a + C/y) for all nonnegative j < p—1. Also, we 
p P 
cannot have that 
o+¢hy=a+ Cy (mod 7), 


for 7 # k, since in that case we get 
Gy(L—G*) =0 (mod p’), 


which cannot hold since gcd(Chy, p) = 1, given that p{ y, and by Exercise 3.37 on page 129, 
1—(J-* and 1—G are associates. Hence, «+¢/y are pairwise incongruent modulo p?. Thus, 
(z+ Gy)(1 — Gp)? are pairwise incongruent modulo p for 0 < j < p—1. By Exercise 4.25 
on page 163, these values provide a complete residue system modulo p. Therefore, for some 
nonnegative 7 < p—1, 


(a + Gy)(1— Gp)" =0 (mod p). 
Thus, for only this value 7 do we have 
x+ Gly =0 (mod p’). 


Since we may replace y by (hy for any nonnegative k < p— 1 in (5.35), we may assume at 
this stage, without loss of generality, that we have already chosen 


x +y=0 (mod p?) and «+ Gy =0 (mod p), with p { (x + Gy) forl<j<p-1, 


so the left side of (5.36) is divisible by at least p?~'p? = p?+!. This implies that n > 2. 
Our assumption is that gcd(x,y,p) = 1, so p { gcd(z, y) = g, the gcd of the two O--ideals 
(x) and (y). Therefore, 
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(x + Gy) = pal;, 
where I; is an Op-ideal for 0 < j < p—1, and 
(w+ y) = pre" +1 g Ip. 
Claim 5.16 gcd(J;,f;,) =1forO0<jAk<p-l. 


Let q | gcd(I;,J,) for a prime DO p-ideal q with j 4 k. Thus, if 


paq | gcd(x + Cy, x + CPy), 
then 
paq | gcd(x(1 — CF), 2y(1 — GB”). 
Thus, 
gq | gcd(a,y), 
contradicting the definition of g. This completes the proof of Claim 5.16. 
By Claim 5.16, we may write (5.36) as 


p-l 
op [] i =p, 
j=0 
where I; = IR for some 9 p-ideal J; | J with 0 <j <p-—1. Hence, 
(a + y) = prer—D+1g 7? (5.37) 


and ; 
(c+ Gy) =pgJ? for l <j <p-1. (5.38) 


From (5.37), we get 
(e+ yp rN TE? = g, 


Substituting this into (5.38), we get 
(x + Gy)prer—D = (@ + y)(Tpdg*)?. (5.39) 
Since p = (1—¢,) is a principal prime 9 p-ideal, then (J; as is principal. By invoking the 


regularity of p and using Exercise 4.11 on page 147, we must have that J; Jo ' is principal. 
Therefore, for 1 < 7 < p—1, we may set 


Ig Ig? = (04/83), 


where a;,6; € Or. Since gcd(J;,p) = 1 = gced(Jo,p), we may assume that gcd(a;,p) = 
1 = ged(G;,p). Thus, from (5.38)—(5.39), 


(@ + Gy) — GP? = (@ + y)(ay/8;)?u;, (5.40) 
where uj € Mo,. Since (a + Gpy)(1+ Gp) — («© + Cy) = G(x +y), we may multiply this by 
(1—¢,)P?-») and use (5.40) with j = 1,2 to get, 


@+u)(S) ma+g)-@+H(B) w= lt nGa—Ger. 
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Multiplying through by (61 62)?/[ui(x + y)(1 + ¢)], we get, 


gg yp pln) 
(12)? uy(1 + G)' 2}? = ui (1 z Cp) ee (BiB)? 


By letting a = a) $2 € Op, v = —ug/[ui (14+ ¢)] € Uo,, B = a261 € Or, y = F162, and 
v1 = ¢p/[ui(1 + ¢,)] € Uo, we achieve, 


a? + uBP = vy (1 — G)Pe-Yp. (5.41) 


We now proceed to show that this contradicts the minimality of n, which will complete the 
proof. 


Above we showed that n > 2, so p(n — 1) > p. Therefore, 
a? + v8? =0 (mod p?). (5.42) 
Since p { 3 by assumption, then 6 has a multiplicative inverse 6; modulo p?, namely 66) = 1 
(mod p?). Multiplying through (5.42) by 6? and rewriting, we get, 
v =(—£1a)? (mod p?). 
From Exercises 4.31—4.32 on page 164, 
—B,a =z (mod p), 


where z € Z, so 
(—B,a)? = z? (mod p?). 
In other words, 
v =z” (mod p?). 
By Lemma 5.11 on page 240, there exists a w € Up, such that v = w?. Hence, via the 
above congruence, (5.41) becomes 


a? + (WB)? = v1(1 = GPO 9, 


which contradicts the minimality of n, and establishes the full result proved by Kummer. 


This concludes this section, and in conjunction with previous sections, establishes a num- 
ber of powerful results that will allow us to establish the fundamental theorem of abelian 
extensions, the Kronecker-Weber Theorem in §5.6. 


Exercises 


5.37. Let K,;/F for 7 = 1,2 be extensions of number fields, and let p be a prime 0 7-ideal. 
Prove that if p is unramified in K,; for 7 = 1,2, then p is unramified in kK, K2. In 
particular, show that if p is completely split in A; for 7 = 1,2, then p is completely 
split in Ky Ko. 


5.38. Let F'/Q be an abelian extension of number fields. In the next section, the Kronecker- 
Weber Theorem will verify that 
F C Q(¢f) for some f EN. 
The smallest such f is called the conductor of F. Prove that if the conductor is odd 


and squarefree, then F'/Q is tamely ramified. 


5.39. Let p > 2 be prime F = Q(¢,), and A = 1— ¢,. Prove that for any y € Op, there 
exists a z € Z such that 
yP =z (mod A?). 
Conclude that 
+? =z (mod p). 
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5.6 The Kronecker-Weber Theorem 


All the people we used to know. 
They’re an illusion to me now. 
Some are mathematicians. 
Some are carpenter’s wives. 


From Tangled Up in Blue (1974) 
Bob Dylan (1941-) 
American singer and songwriter 


This section is devoted to a proof of the Fundamental Theorem of Abelian Extensions, also 
known as the following.>!” 


Theorem 5.23 — The Kronecker—Weber Theorem 


If F is a number field, which is an abelian extension of Q, there exists a natural number n 
such that F C Q(¢,). Moreover, n can be chosen in such a way that n andA - have the 
same prime factors. 


We establish Theorem 5.23 via a sequence of lemmas. We begin by showing that it suffices 
to restrict our attention to the case of prime-power degree. 


Lemma 5.12 If Theorem 5.23 holds for abelian extensions of prime power degree over Q, 
then it holds for any abelian extension of Q. 


Proof. First we show that every number field F' abelian over Q is a compositum of abelian 
extensions of prime power degree over Q. By Theorem A.1 on page 321, 


Gal(F/Q) = [| G;, 
j=l 
where G, is an abelian group of order |G;| = p;’ for distinct primes p;, a; € N, and 


|F : Q| = |Gal(F/Q)| = [[ »%?. 


j=l 
Let F; for i = 1,2,...,r be the fixed field of Hj4i G;, the product ranging over all j 4 ¢ for 


1<j<r. Thus, |F; : Q| = |Gal(F/Q)/T])4; Gi] = |Gil = pj", by Theorem 2.4 on page 60. 
Therefore, by Exercise 3.36 on page 129, the compositum has degree 


Had = |[ > =|F:Q|. 
i=l jel 


Since [[;_, Fi C F, then F =J]j_, F. 


5-171f we had developed the full force of class-field theory herein, then one could “easily” prove this funda- 
mental theorem. For instance see [15, Theorem 8.8, p. 163]. However, even therein, where the main results 
of class-field theory are stated but not proved, it is admitted that “the general theorems of class-field theory 
are complicated to state.” Thus, there is some price to pay in attaining the result no matter what the route 
happens to be since it is a relatively difficult theorem from any perspective. 
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Now assuming that Theorem 5.23 holds for all such F;, then F; C Q(¢,,) for some n; € N. 
Let € = lem(nj,72,...,n,). Then 


F= [4 Ss Q(Giys'Gnayets Gaz) Cc Q(¢e), 


i=l 


and the result is proved in view of Theorem 5.13 on page 215. 


The next lemma is a Galois-theoretic result required for the subsequent lemma. 


Lemma 5.13 Let K,/F be Galois extensions of number fields for 7 = 1,2. Then each of 
the following holds. 


(a) K,K2/Ko is a Galois extension and 


Gal(K1 K2/K2) — Gal( ki /Ky NM Ko). 


(b) The extension Ky K2/K Ko is Galois, and we have the isomorphism of Galois groups, 
Gal(K, Ko/Kk, N Ko) = Gal( Kk, /k, N Ko) x Gal(K2/K, N Ko). In particular, if 
Kin Ke =P, then 


Gal(K,K2/F) & Gal(K,/F) x Gal(K/F). 


c) If K;/F for 7 = 1,2 are abelian extensions of number fields, then K,K2/F is also 
j J 
abelian. 


Proof. (a) By Exercise 2.6 on page 63 there exist |Ay Ke : K2| embeddings of KK. into C 
that fix K pointwise. If o is such an extension, then 


a(K, Ko) = a(K1)o(K2) = a(K1) Ko © Ky Ko. 
Hence, K,K2/Ko is Galois. Consider the mapping 
w : Gal(K, K2/K2) b> Gal( Kk, /k, N Ko), 


given by ¢ +> o|x,, the restriction to Ki. By Exercise 2.6 this is an epimorphism. It 
remains to show that ker(w) = 1. If Y(o) = 1, then o fixes Ky, pointwise, but o already 
fixes Ky pointwise by definition, so o fixes K,K2 pointwise. In other words, 0 = 1, so 
ker(¢) = 1, and 

Gal( Kk, K2/K2) — Gal( ki /Ky NM Ko). 


(b) By the same reasoning as in the proof of part (a), Ay Ko/K1M Ke is Galois. Also, by 
Theorem 2.4 on page 60, K;/K 1 Ke is Galois for 7 = 1,2. Consider the mapping 


p: Gal( Ky Ko/Kky NM Ko) aed Gal(k,/ki M1 Ko) x Gal(Ko/Ky1 NM Ko) 
given by 
prarr (o|K,,0|K2); 


the restrictions to K, and Ko respectively, which is an epimorphism by Exercise 2.6. We 
need to verify that ker(p) = 1. If p(a) = (1,1), then o fixes both Ky andK2 pointwise, so 
o = 1, as required. 
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(c) By part (b), 
Gal( Kk, Ko/Ky M Ko) J Gal( ki /Ky M Ko) x Gal(K2/Ky M Ko) = Gy x Go. 


Since G; C Gal(K;/F) for 7 = 1,2, both of which are abelian, then G, is abelian for 
j = 1,2, so Gal(K) K2/K1 Ke) is abelian. However, by Exercise 2.6, kK, K2/F is Galois, 
and the F-automorphisms of kK, K2 extend to |KiK2 : Kyi M K2| embeddings of Kk, K2 
into C, so Gal(K, K2/F) is abelian. 


Lemma 5.14 If F'/Q is an abelian extension with |F’ : Q| andA r both being powers of a 
prime p, then F’ C Q(¢k) for some k € N. 


Proof. We split the proof into the odd and even cases. 
Case 5.4 p>2 


Let 

K = Q(¢pm+1), where |F : Q| = p™. 
By Application 5.1 on page 229, Gal(K/Q) is cyclic of order ¢(p™*1). Let H be a subgroup 
of it of order p— 1, and let L be the fixed field of H. 


Claim 5.17 |FL : Q| is a power of p. 


Since |H| = p—1, then L/Q is a cyclic extension withA , a power of p given thatA , | Ax. 
Thus, F'L is an abelian extension of Q by part (c) of Lemma 5.13 on the preceding page. 
Also, 

IFL:Q)=|FL:L|-|L:Q=|F: FOL|-|L:Q, 


which is a power of p, where the last equality comes from part (a) of Lemma 5.13. 


Claim 5.18 A ry is a power of p. 


Suppose that q | Arr. Then by Exercise 5.37 on page 243, either qg is ramified in L/Q or q 
is ramified in F'/Q. Therefore, either q | A, or q | Ar. However,A ; | AQ(Ejm41)? which is 
a power of p, andA - is a power of p by hypothesis, so q = p. This establishes Claim 5.18. 
In view of Claims 5.17-5.18, we may invoke Exercise 5.41 on page 253 to get that Gal(F'L/Q) 
is cyclic of prime power order. Since, by part (b) of Lemma 5.13, 


Gal(FL/LN F) © Gal(F/L 0 F) x Gal(L/LN F), 


then by Exercise 5.40, either Gal(F/IZN F’) = 1 or Gal(L/LN F) = 1. If the former occurs, 
then F = LF, so F C LF, and in the latter case, D = LMF, so L C F. However, 
|F : Q| = |L: Q|, so F = L, which implies that F C Q(G,m+1), thereby establishing Case 
5.4. 


Case 5.5 p= 2 


Claim 5.19 For any m €N, there exists a totally real field K such that |K : Q| = 2™ with 
AK = 2", and K C Q(¢gm+2) for some n € N. 


Let L = Q(Cgm+2) and set K = LOR. Since m+2 > 3, then /—1 =7 € L, so for atbi € L, 
we must have 2a,2b ¢ K. Therefore, a,b © K and L = K(i). Hence, |L : K| = 2, so 
|k : Q| = 2™. Ifq | Ax for a prime q, then q ramifies in K, so q ramifies in L. Thus, 
q | Ax, which is a power of 2, so gq = 2. Thus,A x = 2” for some n € N. This completes 
Claim 5.19. 
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Claim 5.20 For a given m EN, the field K in Claim 5.19 is unique. 
K is the maximal real subfield of Q(¢gm+2). If Ay # K is another such field, then 
|KK,:Q|>2™??. 


Therefore, KK, = Q(gm+2), contradicting the fact that KK, is real. This establishes 
Claim 5.20. 

Since F' and Q() are abelian extensions of Q, then F(i) is an abelian extension with degree 
a power of 2 over Q, by part (c) of Lemma 5.13. Let 


K=F(Q)OR. 
Then K is a real extension of Q, 
|i : Q| = 2° 


for some s € N, andA x is also a power of 2. By Claims 5.19-5.20, K C Q(¢g:+2). Since 
F(t) = K(a+ bi) for some a,b € R, then given that a— bi € F(z), we must have a € K and 
bi € F(i). Thus, b? € K, so a+ bi is a root of 


a? —2ar+a* +0? € K[z]. 


Hence, |F'(z) : F| = 2. Therefore, 


F € F(i) = K(t) © Q(Ggs42, 4) € Q(G,), 


for some r € N, which establishes the full result. 


Before proceeding, we need the following important concepts, which are related to Defini- 
tion 5.13 on page 224. 


Definition 5.16 — Ramification Groups and Ramification Fields 


Let K/F be a Galois extension of number fields, and let P be a prime Ox-ideal. For each 
nonnegative integer 7 define: 


V; ={0 €Tp(K/F):0° =a (mod P!t") for all a € Ox}, 


called the j“” ramification group of P in K/F. The fixed field Vi? (K/F) is called the j*” 
ramification field.5!8 (Note that Tp(K/F) = Vo and V\°)(K/F) = Tp(K/F).) 


We now establish some properties of the concepts in Definition 5.16 since they are needed 
in the sequel. 


Lemma 5.15 
(a) Vj; is a normal subgroup of Jp(K/F). 
(b) Je(K/F) = Vo DVi 2D--. 


(c) There exists an m € N such that V,, = 1. 


5.18The letter V is used for the ramification fields given that the derivation is from the German Verzwei- 
gungskorper—see Footnote 5.9 on page 224. The ramification groups were first defined by Hilbert in 
1894—-see Biography 3.4 on page 94. 
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(d) If A} denotes the multiplicative group of nonzero elements of the field Kp = Ox/P?, 
there exists a natural group isomorphism of Jp(K/F)/Vj into K%. 


(e) Tp(K/F)/V is a cyclic group with order not divisible by p where p = P/N Z. 
(f) For j > 0, the groups V,;_1/V; are elementary abelian p-groups. In other words, they 


are finite dimensional vector spaces over F,. 


g) Vj, is a p-group, and T.) (K/F) is the maximal tamely ramified extension at P con- 
P 
tained in Kk. 


(h) Suppose that Dp(K/F) is abelian, and set g = |F,| = N?/2(p), where Fy = Op/p. 
Then 72-1 € Vj for all tr € Tp(K/F) and 


\T>(K/F)/Vi| | (q-1). 


Proof. (a) If o € Tp(K/F), then P? = P, so (Pt!) = PIt!. Thus, o has the action 


o(@) = o(a), 
where the % denotes the image of z € Ox in Ox /PI*! under the natural mapping 
W: ORK OK/PIT. 


Therefore, o € V; if and only if o is the identity mapping on Dx /P/t". Since ker(y) = Vj, 

then by Theorem A.5 on page 328, V,; is a normal subgroup of Tp(A/F). 

(b) We have that Vj;41 C V; for 7 = 0,1,... since a” = a(mod PJ*?) implies that a7 = a 

(mod J++), 

(c) Ifo € N9V;, then o(a) — a € NZ P!*". Therefore, o(a) = a for all a € Ox. Hence, 

NFL0V; = 1. However, Tp(K/F) is a finite group, so there must exist an m € N such that 
os 

(d) Let Ox /P = Ko. 

Claim 5.21 For a fixed y € P — P?, and any o € Tp(K/F), there exists ag € Ox such 

that for 


Oo 


77 =agy (mod P”), 
where a, is uniquely determined modulo P. 


Let yOu = PI, where Pt J. Then by Theorem 1.21 on page 32, there exists a solution to 
the system of congruences 
g=7" (mod P*), 


x =0 (mod J). 


Let ag = xy~!. Then a, is uniquely determined modulo P and 
Ay =xL£=7" (mod F?). 
This completes the proof of Claim 5.21. 


Claim 5.22 For any 0,7 € Tp(K/F), ag, = a0, (mod P). 
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We have 


a ee 


dor) =" = (1) = (a0) = 0577 = aa,7 (mod F). 


Since y € P — P?, then by multiplying through the congruence 
Qory = ata,y (mod P?) 
by y~! we get, 
Agr =aza, (mod P). 


However, a7 = a, (mod P) for all T € Tp(K/F). This yields Claim 5.22. 


Define a map: 
p:Tp(K/F) 4 K5, 


by 
piTgr Ag. 


By Claims 5.21—5.22, p is a well-defined homomorphism of groups. Since a, = 1 if and only 
if y7 = y(mod ??), by Claim 5.21, then a, = 1 holds if and only if o € V1, so V1 = ker(p). 
This completes the proof of (d). 


(e) If e; = limg(p)|, then p { e; since 
e1 | |Kp| =p°—1 


for some e € N. Also, since img(p) is a subgroup of K}, then by Theorem A.8 on page 331, 
Tp(K/F)/V, is a cyclic group, and by the above has order prime to p. This is (e). 


(f) This part proceeds in much the same fashion as the solution to (d), except that we work 
on the additive group Kf of Kp. Let y € P — P? be fixed. Then 7/ € PJ — PI+! for any 
JEN. 


Claim 5.23 For any o € Vj-_1, there exists a, € Ox such that 
y= y+ag7) (mod P+), 


Set yOx = PII where P + J. By the Chinese Remainder Theorem for ideals cited above, 
there exists a solution to the congruences 


r=y° (mod P3*1), and z=0 (mod J). 
Select ag = (x — y)y~4. Then 
Ay) =£-y= 7° —7 (mod P+"), 


Thus, ; 
y = y+tac7 (mod perry, 


which is Claim 5.23. 
Claim 5.24 For all 0,7 € Vj_1, agr = A +a; (mod P). 


We have that 


ty Se SSO Sr Ot ery Sy tent Sy 


=yta,y" tary -y=V(artaty-Y) (mod P+), 
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Thus, multiplying through by y~/ we get 
Agr =a, taty7-Y) (mod P). 


Since at = a, (mod P) for all 7 € Vj_1 C Vo and 77-) = 1(mod P) given that 77 = 7 
(mod ?), then Claim 5.24 follows. 


Define a map 
pia: Vi-1 > Ke, 


by 


Ot? dz, 


which is a well-defined additive group homomorphism independent of the choice of a by 
Claims 5.23-5.24, and ker(pi) = Vj. Hence, V;~1/V; is a direct sum of cyclic groups of 
order p, since Ke is such a sum, so Vj_1/V,; is an elementary abelian p-group, thereby 
securing (f). 
(g) By parts (b)—(c) above, 

Vo 2 V1 2-++ 2D Vm = 


for some m € N. Also, Vo/V; is a cyclic group, and V;_1/V; os elementary abelian 
p-group by parts (e)—(f) just proved, so V, is a p-group. Hence, To UK /F) is the maximal 
tamely ramified extension at P contained in K, which is (g). 

(h) Let o € Dp(K/F) be the element such that its image in Gal(p/F,) is the Frobenius 
automorphism. Then for each r € Tp(K/F'), we have from Claim 5.21 in the proof of part 
(d) that 


-1 


977? = (YOig-1)"? = (7 a1)? = (arya -1)” 
= aey7art, =alry’at_, =ala,yaz_1 (mod py, 


We have shown that - 
y TT =a2%a¢a%-1y (mod P). (5.43) 


Claim 5.25 a,a%_, = 1(mod ”). 


We have 
77 =agy (mod 7), (5.44) 


and 
-1 


7° =a,-17 (mod P?). (5.45) 
Putting together (5.44)—(5.45), we get 


-1 -1 -1 -1 


(Qa) Ie= A Pasay b= O0Ay =774" “Ht (mod 2"), 


as required to complete Claim 5.25. 


By Claim 5.25 and (5.43) and the fact that o is the element such that its image in 
Gal(K'/F,) is the Frobenius automorphism, 


were = =aly=a%y (mod P”). (5.46) 


T 


However, 77 =a,7y(mod ??), 77 =a? (mod 7), and so on. Thus, by induction 


47" =a‘%y (mod 7). (5.47) 
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Combining (5.46)-(5.47), we get y= 77" (mod 2). Thus, 777 "7 * = y(mod 2). 
We have shown that ora~'r~4 € V1. When D-(K/F) is abelian, then 
(ara 37-4) = 7h 1 € Vi, 


for all rT € Tp(K/F). Since Tp(K/F)/V, is cyclic and r?-1V, = Vi, then 


\Tx(K/F)/Vi| | (q-), 


which is (h) so we are done. 


Lemma 5.16 Let F' be an abelian number field over Q with |F : Q| =n. Then for every 
prime p | Ar, with p{ n, there exists an abelian number field K over Q such that |K : Q| | n, 
FC K(G), and p{ Ax. Furthermore, any prime divisor ofA x is a prime divisor ofA p. 


Proof. We break this into two cases. 

Case 5.6 p | Ar, p{n, and ¢ € F. 

Since p{n, then by Theorem 5.4 on page 189, p { er/g(p). By part (g) of Lemma 5.15 on 

page 247, |F : Ve (F/Q)| is a power of p, but |F : Ve (F/Q)| | n, so F = VE? (F/Q), 

where p is a prime O;-ideal over p. By part (h) of Lemma 5.15, |Tp(£'/Q)/V1| | (p — 1), 

but since F = Vi? (F/Q), then |Tp(F'/Q)| | (p — 1). However, by Theorem 5.1 on page 184, 
er/alP) = €r/accy) (Pearcy) al? O ZlG))- 

Since |Q(¢p) : Q| = p— 1, by Corollary 1.17 on page 41, then ep/qc,)(p) = 1. 

Claim 5.26 Kk =T,(F/Q) satisfies the conditions of the lemma. 

Since Gal(F'/Q) is abelian, then K’/Q is an abelian extension with |K : Q| | n. Also, 


Tp(K/Q(6p)) = Tp(F/Q) N Gal(K/Q(Gp)). 


By a similar argument to the above, the first ramification field of p in F/Q(G,) is 


ViO(F/Q(ép)) = F. 


Therefore, |F' : K(¢,)| = ep(F/Q(G)) = 1, so F = K(G,). Since K = T,(A/Q), then 
pt Ax by Corollary 5.8 on page 210. Furthermore, if g 4 p is a prime with q | Ax, then q 
ramifies in K/Q, and so must ramify in F/Q. Hence, q¢ | Apr, which completes Claim 5.26, 
and so Case 5.6. 


Case 5.7 p| Ar, ptn, and ¢, ¢ F. 


Let L = FM Q(G,). Then by part (b) of Lemma 5.13 on page 245, Gal(F(¢,)/L) = 
Gal(£/L) x Gal(Q(¢,)/L). Thus, 


IF (Gp): L]- |L: Q| = |F(Gp) : Q| = |F : L]- |Q(G) : LI - |Z: Ql, 
and this last value equals both 
|F : L| -|Q(¢p) : Ql =|F : L|- (p— 1), (5.48) 


and 


IF: Q|- |Q(Gp) : Ll = n- |Q(G) : LI. (5.49) 
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From (5.48)—(5.49), 

IF (Gp): Q)=|F: L]-(— 1), (5.50) 
and 

lF(¢,) : Q| =n- |Q(¢,) : LI. (5.51) 
Thus, by multiplying (5.50)-(5.51), we get |F(¢,) : Q)? = |F: L|- (p—1)-n- |Q(G) : LI. 
Therefore, since part (a) of Lemma 5.13 tells us that |F(¢,) : F'| = |Q(¢,) : Ll, we have 
|F(Cp) : Q|- |Z: Q? = (p—1)-n. Hence, 


|F (Gp) : Q| | n- (p— 1). (5.52) 


Since p ramifies in F’, then p ramifies in F'(¢,)/Q. Therefore, p | AFrce,): From (5.52), this 
yields that p { |F(¢p) : Q|. Let P be a prime Dp,¢,)-ideal over p. Now we apply Case 5.6 to 
F (<p). Let K = To(F(p)/Q) C F (Gp). Then |F'(6p) : K| = e9(F(p)/Q) = p— 1. Also, 


IFS) : QO = |FG): K]- |: Q) = (w— 1) -|K : Q. 


Thus, by (5.52), |K : Q| | n. Since p is unramified in K’/Q, then pt Ax. Also, ifq#pisa 
prime such that q | Ax, then q ramifies in K/Q, so also in F(¢,)/Q. By Exercise 5.37 on 
page 243, q must be ramified in F'/Q or in Q(¢,)/Q. Since g # p, then by Corollary 3.9 on 
page 125, q ramifies in F'/Q, so q | Ar, and we have completed the entire proof. 


Lastly, we have the following concluding lemma. 


Lemma 5.17 If Theorem 5.23 holds for abelian number fields whose degree and discrimi- 
nant are a power of a given prime p, then it holds for arbitrary abelian extensions of degree 
a power of p. 


Proof. Let |F : Q| = p™. If gq # pis a prime dividingA -, then by Lemma 5.16, there 
exists an abelian extension A’/Q such that F C K(G), |K : Q| | |F : Q|, at Ax, and if 
r is a prime dividingA x, then r|Ar. Hence,A x has fewer distinct prime divisors than 
Ar. Suppose thatA x is not a power of p. Then we repeat the above argument on 
and get another field Ay withA x, having fewer distinct prime factors thanA x, while ky 
satisfies the properties of Lemma 5.16. Since there exist only finitely many such primes 
by Corollary 5.7 on page 210, we terminate this process after a finite number, r + 1, of 
iterations. Therefore, for integers s = 0,1,...,7, we have abelian extensions K,/Q such 
that |K, : Q| is a power of p, and K,, C K(¢s,), for some s; € N, and K, C Q(¢,) for 
some r € N, with the last containment coming from Lemma 5.14 on page 246. Hence, 


FC K(¢s,), KC Ki(63,), iy C Ko(Gs,),---; Kr © Q(Cs.,.). 


Therefore, F C Q(¢s9,¢s1;-++5¢s,) © Q(Gn), where n is the lem of the orders of the ¢,, for 
FSU Ay eeey hs 


Theorem 5.23 is now an immediate consequence of Lemmas 5.12-5.17. The proof of the 
Kronecker-Weber Theorem places us at the doorstep of class-field theory, at which we 
have already had a peek via Theorem 5.21 on page 239. The celebrated Kronecker-Weber 
Theorem was first stated by Kronecker in 1856, and first proved by H. Weber in 1886—see 
Biographies 4.9 on page 164 and 5.4 on page 254. Numerous proofs have been given since 
then. Among them are one given by Hilbert in 1896, one by F. Mertens in 1906, and 
another by Weber himself in 1907. A proof was given by the late Hans Zassenhaus in 
1969. More recently a proof was given by Greenberg in 1974—see [23]—[24]. Although the 
proof of the latter is deemed to be “elementary,” once all the facts cited therein are proved, 
the proof turns out to be longer than the once presented here and essentially the same 
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sequence of lemmas is employed, so the reader is now provided with a relatively complete 
and straightforward introduction to the theorem. 


Exercises 


5.40. 


5.41. 


5.42. 


5.43. 


5.44. 


5.45. 


5.46. 


5.47. 


5.48. 


Let G be a cyclic group of order p” where n € N and p is prime. Prove that if 
G = G X G2 where G; are cyclic groups of order p™ for 7 = 1,2, then either m; = 0 
or mz = 0. 


Prove that any number field F' abelian over Q with both degree over Q and discriminant 
a power of an odd prime must be a cyclic extension of Q. 


Let G be a finite abelian p-group, where p is prime, and let |G| = p™, form € N. 
Establish the following two facts. 


(a) For any subgroup H of G of order p” with n € N, there exists a subgroup of G 
of order p” forn <r <m containing H. 


(b) If G has only one subgroup of order p™~!, then G is cyclic. 


Let K/F bea Galois extension of number fields, and let P be a prime D x-ideal. Prove 
that all ramification groups V,; for 7 > 0 are normal subgroups of Dp(K/F). 


(Hint: See Lemma 5.15 on page 247.) 


Let K/F be a Galois extension of number fields, and let P1,P2,...,Pn be the prime 
Ox-ideals ramified in K/F (possibly the empty set—see Remark 5.8 on page 240). 
Suppose that H is the subgroup of Gal(K’/F’) generated by the inertia groups Tp, (K/F’) 
for 7 = 1,2,...,n, and let L be the fixed field of H. Prove that L is the maximal 
subfield of K that is unramified over F. In particular, conclude that if F = Q, 
then Gal(K/F) = H is generated by the inertia groups. (This result is called the 
Monodromy Theorem for algebraic number fields.) 


Suppose that K/F is a Galois extension of number fields with P a prime 0 x-ideal. 
Let V; for 7 = 0,1,2,...,m—1 be all of the nontrivial ramification groups of K/F 
with different Dp. Prove that if P* | Dx p, but P**! { Dx/p, then 


= Lmi-v 


(This equation is called Hilbert’s formula.) 


Let K/F be a Galois extension of number fields of degree n, and let P be a prime 
Ox-ideal with e = ex/p(P). Prove that P is tamely ramified in A/F if and only if 
P° + Dp. Conclude that P is tamely ramified in K’/F if and only if p” { Ax; where 
p= ?NOFf. (Note that this establishes the promised converse of the result discussed 
in Remark 5.3 on page 213.) 


Let K/F be an extension of number fields. Prove that Tk;r(Ox«) = Or if and only 
if there is no prime O-ideal p that divides Dx p. 


Suppose that G is a multiplicative group of order n and R is a ring. Let R[G] denote 
the additive abelian group 


Ss B+ th. 


GEC n re 
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5.49. 


5.90. 
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Thus, R[G] consists of the formal sums er 9,9; for rg, € R and g; € G, with the 
sum ranging over all of the n elements g; of G. Addition is defined by 


n n 


n 
ys 19595 + » Sh3 95 = a + 8g; )9i, 


j=1 j=l j=1 


with possibly some zero coefficients to ensure that any two of these formal sums range 
over the same indices g; for 7 = 1,2,...,n. Also, multiplication is defined by 


(>. roa] She =e ae Gis): 


i=1 j=l i=1 j=1 


Then with these operations R[G] is called the group ring of G over R. Let K/F 
be a Galois extension of number fields with G = Gal(K/F) and R = DOr. Suppose 
further that R[G] = Ox. Prove that Tk;p(Oxn.) = R. In particular, when F = Q, 
we think of Dx and K as Z[G]-modules by the action ()),799)t = di4799(x) for 
xz € K. Use this to conclude that a Galois extension K/Q has a normal integral basis, 
namely a basis consisting of conjugates of a single integer, if and only if Z[G] = Ox 
as Z|G]-modules—see Remark 2.3 on page 79. 


Let K/F be a Galois extension of number fields such that Or[G] = Ox~ where G = 
Gal(K/F') (see Exercise 5.48). Prove that there does not exist any prime 0 f-ideal 
p such that p” | Ax/r where n = |G|. Conclude that if 0x = Op[G] as an D-[G]- 
module, then K/F is tamely ramified. (This result was first proved by A. Speiser in 
1916.) 


Biography 5.3 Andreas Speiser (1885-1970) was born on October 6, 1885. 
He studied at Gottingen from 1904 to 1909 as a student of Minkowski. His 
dissertation was on binary quadratic forms over general algebraic number 
fields. He wrote a book on group theory entitled Die Theorie der Gruppen 
von endlicher Ordnung, which was published in 1923. Several new editions 
came out, with the last one in 1980. He is also known for his editing of several 
collected works including, and especially, that of Euler. He died on December 
10, 1970. 


With reference to Exercises 5.48-5.49, prove that a quadratic extension K of Q with 
Ax/g even cannot have a normal integral basis. 


Biography 5.4 Heinrich Martin Weber (1842-1913) was born on May 5, 
1842 in Heidelberg, Germany. He was a student of Dedekind, and worked 
principally in algebra and number theory. His best-known work is his three- 
volume Lehrbuch der Algebra, which was published in 1895. This text became 
a standard, and influenced an entire generation of mathematicians to bring 
group theory into the twentieth century as a major branch of mathematics in 
its own right. Weber’s proof of Theorem 5.23 on page 244 is known to have 
gaps (see the introduction to [27]). He died on May 17, 1913 in Strasbourg, 
Germany (now part of France). 
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5.7 An Application—Primality Testing 


La dernier chose qu’on trouve en faisant un ouvrage, est de savoir celle qu'il faut 
mettre la premiere. 
The last thing one knows in constructing a work is what to put first. 
From Section I, no. 19 of Pensés (1670), ed. I. Brunschvieg (1909) 
Blaise Pascal (1623-1662) 
French mathematician, physicist, and moralist 


In this last section of chapter five, we look at an application of the contents to primality 
testing. By a primality test, we mean an algorithm that determines whether a given n € N 
is prime. In this section, we look at a primality test described by Lenstra in [42]. This 
algorithm relies upon arithmetic in abelian extensions of Q, and certain residue symbols. 
Hence, this may be viewed as an introduction to Chapter 6, as well as an application of the 
results of this chapter, including the Artin symbol and the Kronecker-Weber Theorem—see 
Definition 5.14 on page 229 and Theorem 5.23 on page 244. 


The genesis of primality testing may be said to originate two hundred years before Christ 
with the Sieve of Eratosthenes—sce [53, p. 32]. There is also the observation attributed to 
Fibonacci that a composite n € N has a prime divisor less than ,/n. Another classical test 
given by Wilson’s Theorem says that 


n € N is prime if and only if n | [(n — 1)! + J]. 


However, each of these three tests is highly inefficient. In other words, there is no known 
way to compute 
(n—1)!+1 (mod n), 


for instance, in reasonable time for large values of n. Gauss computed large tables of 
primes, which provided enough data for him to conjecture the Prime Number Theorem—see 
Theorem A.28 on page 343. Gauss himself recognized the importance of factoring and 
primality testing, citing these being among the most important problems in arithmetic—see 
$4.4 for an overview of factoring. In the twentieth century, the pioneering work of D.H. 
Lehmer produced a school of thought in computational number theory that led to an array 
of very clever ideas for factoring and primality testing—see Biography 5.5 on page 259. 


There are numerous primality tests both classical and recent. There is the Elliptic curve 
test, which the reader will find in [54], the Lucas-Lehmer test, Pepin’s test, and Pocklington’s 
Theorem, the details, for the latter three, which the reader will find in [53]. See also [71] 
for a detailed history of primality testing. 


The test to be described in this section is based upon the following obvious result. 


Theorem 5.24 Criterion for Primality 


If n € N with n > 1, then n is prime if and only if every divisor r of n is a power of n. 


Of course, in practice, primality tests do not directly check that divisors of n are powers of 
n. However, this is done for images of r and n in certain groups G. Given a number n € N 
to be tested, we proceed as follows. Set 


S={reN:r|n}. 


256 5. Ideal Decomposition in Number Fields 


There are three stages in primality testing algorithms based upon Theorem 5.24 on the 
previous page. They are described as follows. 

Stage 1. This stage consists of finding a group G and a natural map o from 8 to G with 
the property that o(rir2) = o(r1)o(r2) whenever 71,12 € 8. For instance, G = (Z/sZ)* for 
some s € Z such that gcd(s,n) = 1 and o(r) =7, where 7F is the least positive residue of r 
modulo s, will suffice. 

In the tests described below, G will always be Gal(/‘/Q) for some finite abelian extension 
K of Q such that gcd(Ax,n) = 1. By the Kronecker-Weber Theorem, there is an s € N 
such that kK C Q(¢s) with gcd(s,n) = 1. Let 0 € Gal(Q(¢;)/Q) defined by 0(¢,) = ¢f 
for a given r € 8. Then define o(r) = 6|x. Observe that o(rir2) = o(ri)o(r2). Also, by 
Corollary 5.8 on page 210, r is unramified in K for any prime divisor r of n. Thus, if r is 


prime we may view o(r) as the Artin symbol (42). 


For any r € 8, we define 
K°) = {ae K:a?™ =a}, 


and observe that if r is prime, then 
K°) = Z,(K/Q) 


see Definition 5.12 on page 221. 


Stage 2. This stage consists of showing that o(r) is a power of o(n) for any r | n, and we 
clearly may restrict our attention to prime divisors of n. In practice, this stage consists of 
putting n through a number of pseudoprimality tests—such as the Miller-Selfridge-Rabin 
test—[53, p. 119]— satisfying the properties: 


(a) It is known that n passes the tests if n is prime. 


(b) Ifn passes the tests, then we may conclude that o(r) is in the subgroup of G generated 
by o(n) for all divisors r of n. 


In the tests described below, this stage will consist of looking for a ring homomorphism, 
w : D yeo(n) b> Z/nZ, 


with w(1) = 1. To show that the finding of such a homomorphism will do the job described 
above for stage 2, we first show that when n is prime that such a homomorphism exists. 
Then we show that indeed its existence implies that o(r) is in the subgroup of G generated 
by a(n) for all divisors r of n. 


Given that n is prime, o(n) is the Frobenius automorphism, or Artin symbol which generates 
the decomposition group of n in K/Q. Therefore, by part (b) of Corollary 5.17 on page 227, 
the decomposition field of n in K/Q, 


K"™ = Z,(K/Q), 


is the largest subfield of K in which n splits completely. Therefore, there exists a prime 
D o(n)-ideal p above n such that 


Doin) /P > Z/[nZ °F, 
is the residue class field, so we have the existence of a ring homomorphism 


w 7 D join) > Z/[nZ 
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—see Definition 5.1 on page 182 and Diagram 5.2 on page 228. However, if such a w exists, 
this does not ensure that n is prime. The methods for finding such a w usually detect 
a composite number, for example by finding an integer a such that a” 4 a(mod n)—see 
Exercise 4.31 on page 164. However, there exist composite integers such as n = 561 = 
3-11-17 for which a” = a(mod n) for all integers a—see Exercise 5.51 on page 260. 


Suppose now that we have found such a ~ in stage 2 (and we assume that we can do so in 
computationally feasible time).°'9 Let r be a prime divisor of n, and let 


p:Z/nZ+ Z/rZ 


be the canonical map. Form pow: Ox +> Z/rZ, which is a ring homomorphism. Thus, 
the kernel of po w is an ideal R in Ooin), and since 


D icon) /R = Oxon) /ker(p oy) = img(p oy) = Z/rZ, 
then & is prime. Since ® is of degree one, then 
K*™ ¢ Z,(K/Q). 


Thus, K?” is fixed by o(n), and Z,(K/Q) is fixed by o(r). Thus, by Theorem 5.21 on 
page 239, (a(r)) C (a(n)), as desired. Thus, we have shown that the existence of such a w 
guarantees that (b) above holds. 

Stage 3. Use the information in Stages 1-2 to finish the primality test. In other words, 
the information will verify that n is prime or it will determine that it is composite. 


The following is an application of the above primality test. 


Example 5.15 Let n € N be given, and let s be the largest divisor of n — 1 for which we 
know a complete factorization. If K = Q(¢,), then by Application 5.1 on page 229, 


Gal(K/Q) = (Z/sZ)*, 


where o(r) € Gal(A/Q) corresponds to 7, with 7 denoting the residue class of r in (Z/sZ)*. 
Since n = 1(mod s), then 


K°M = K, Ox =Z[¢], and me, z(x) = ®,(z). 


If a € Z such that 
a®° =1 (mod n), 


and 
gcd(a’/4 —1,n) =1 


for all primes q | s, then the residue class of the s*” cyclotomic polynomial at a modulo n 
vanishes, namely 
®.(a) =0 


in Z/nZ. Given such a value a, we get a ring homomorphism 7 : Open + Z/nZ by 
mapping ¢, to a. Observe that ¢§ + a® = 1 in Z/nZ. Thus, by the discussion of stage 
2 above, (a(r)) C (a(n)). Therefore, r = 1(mod s) for all r | n. Hence, if s > Vn, it is 
certain that n is prime. This is known as Pocklington’s Theorem see [53, Theorem 2.25, p. 
123]. 


5.19The term computationally feasible or computationally easy means in reasonable computational time. 
On the other hand, problems that are computationally infeasible, or computationally impossible are those 
for which there (theoretically) exists a unique answer, but we cannot find it even if we devoted every scintilla 
of time and resources available. However, it should be stressed here that there is no proved example of a 
computationally infeasible problem. 
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A simple illustration of Example 5.15 on the previous page, is to test the fourth Fermat 
number 
Fy= 2 +1=n 


for primality. Let s = 21°, K = Q(¢,), and select a = 3. Then 
a =3?" =1 (mod n), and a°/? = ae #1 (mod n). 
Hence, by Pocklington’s Theorem, F is prime. 


The main application of Lenstra’s primality test is described as follows. 


Let s € N such that ged(s,n) = 1, where the complete factorization of s is assumed to be 
known. Let t be the order of n modulo s. In other words, t € N is the smallest value such 
that 

n'=1 (mod s). 


Thus, ¢t is the order of n in (Z/sZ)*. For computational purposes, we assume that ¢ is 
relatively small. Let kK = Q(¢,), so 


Gal(K/Q) & (Z/sZ)* 
as above. By Corollary 5.13 on page 218, 


JK: K7™| =¢, 
and 
t-1 
Me, Kein) (Z) = [[@ = 4 ). 
j=0 


It follows from Example 1.22 on page 19 that DO j,«(m) is generated as a ring by the coefficients 
of Me, coin) (x). Thus, to find a ring homomorphism 


w iE D peo(n) b> Z/[nZ, 


it suffices to find a ring extension R of Z/nZ and a homomorphism 


yp: Z[Cs] > R, 


mapping the coefficients of m__ jon) (x) inside Z/nZ. Suppose that we have such a ring. 
To find a, it suffices to find £ 


W(Cs) =aER 
such that a* = 1, a*/4—1 € R* for all primes g | s, and 


If such an element a has been found, there exists a ring homomorphism 
w E D jo(n) b> Z/nZ, 
so from Stage 2, it follows that every r | n is congruent to a power of n modulo s. 


If we assume that s > \/n, then it suffices to try the least residues ni modulo s for 7 = 
0,1,2,...,¢— 1 as possible divisors of n.>-? 

To illustrate the above, we show that the following classical result is a special case of our 
test. 


5.20Tn [42] it is concluded that the expected running time of the algorithm is less than (logn)¢!°s sles” , 
where c is some effectively computable constant. 
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Application 5.4 — Lucas—Lehmer Test for Mersenne Primes 
Let n = 2” —1 with m EN, m > 2. Set e; = 4 and e541 =eF-2ifj> 1. Then n is prime 
if and only if €m—1 =0(mod n). 


To show that this test is a special case of our algorithm, we let s = 2+! and t = 2. The 
interesting case occurs when m is odd (since the case for m even is easy). Define a ring 


where \/2 means 
amt/2 (mod n) € Z/nZ. 


Let @ : Z[¢,] 4 R, as above and set (x) = a. Set b = /2—a = —a™!, which is “the other” 
zero of #7 — \/2x—. in R. By a simple induction argument 


a?’ +b?’ =e; (mod n), (5.53) 


for 7 € N. If n is prime, then R is a field in which a and b are conjugate, so a” = b by the 
theory of finite fields—see §2.1. Hence, 


+a-2”""" = 0(mod n). Conversely, 
2”"*" = 1(mod n). Thus, a°/? —1= 


so from (5.53), we get, €m—-1 = a2” +0?” =a 
if €m—1 = 0(mod n), then a?” = —1(mod n), so a® =a 
—2¢ R*. From a” = a?" ~! = —a~! = b(mod n), we get 


(a — a)(a — a”) = (aw —a)(x— b) (mod n), 


and 7 
(x — a)(x —b) =a? —-V2x-1€ (zl: 


Hence, these conditions guarantee that there exists a ring homomorphism 
D yeoin) b> Z/nZ, 


via Stage 1, and that every divisor of n is congruent to 1 or n modulo s. Hence, for s > n, 
we get that n is prime. 

The test in this section can be used with that given in [1]. The reader is encouraged to solve 
Exercise 5.52 which opens the door to understanding the concepts used in [1], which also 
employs Artin symbols. Furthermore, the solution of Exercise 5.52 generalizes the notion 
of a quadratic Gauss sum given in Exercise 5.33 on page 232, and prepares the reader for 
Chapter 6 where we look at Reciprocity laws and residue symbols in general. 


Biography 5.5 Derrick Henry Lehmer (1905-1991) was born on February 23, 
1905 in Berkeley, California. He got his first degree from the University of 
California there in 1927. Then he achieved his Sc.M. from Brown University 
in 1929. Perhaps the best insight into his contributions may be seen in his col- 
lected works [36]. He was truly a pioneering giant in the world of computational 
number theory, and was widely respected in the mathematical community. He 
was also known for his valued sense of humour, as attested by John Selfridge 
in the forward to the aforementioned collected works, as well as by one of 
Lehmer’s students, Ron Graham. In particular, Selfridge concludes with an 
apt description of Lehmer’s contributions saying that he “has shown us this 
beauty with the sure hand of a master.” He died on May 22, 1991. 
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5. Ideal Decomposition in Number Fields 


Exercises 


5.01. 


5.92. 


5.93. 


5.04. 


Prove that 2°°! = x(mod 561) for all « € N. 


(The value 561 is the smallest Carmichael number, which is a composite integer n € N 
such that a"~! = 1(mod n) for alla € N such that gcd(a,n) =1. They are also known 
as absolute pseudoprimes. We have occasion to use this in tect—see page 257.) 


Let g = p” where p is prime and n EN. If x is a character on Fj and a € F@, then 
= Teg ft, (20) 
F ra 
Galx) = DIx(@)GQir™, 
x=0 


is called the Gauss sum on Fy belonging to the character x. (Recall that the trace 
of an element Tp, /7,(@) = a a”, the sum of its conjugates over the base field, 
which is essentially the same as that given in Definitions 2.4 on page 65 and 5.2 on 
page 184 for number fields.) 


Prove that for any a € Fj and any character x on F%, 
Ga(x) = x(a" )G(x), 
where G(x) = Gi(y). Conclude in particular that 


Ga(e) = 0. 
Prove that if a,x,y € Fy, then 


1 F ar— 
+ Si Catal (x—y)) = bn,y; 
q ack, 
where dz,, is the Kronecker delta—see Theorem 5.7 on page 199 and Exercise 5.52. 
Suppose that y # € in Exercise 5.52. Establish the following generalization of Exer- 
cise 5.34 on page 232: 
IGa(x)| = V4. 


In particular, conclude that for any a € F7, we have 


Galx)Ga(x) = x(-1e. 


(Hint: Use Exercise 5.53.) 


Chapter 6 


Reciprocity Laws 


Laws are like cobwebs, which catch small flies, but let wasps and hornets break through. 
from A critical essay upon the faculties of the mind (1709) 

Jonathan Swift (1667-1745) 

Anglo-Irish poet and satirist 


It may be said that the story of reciprocity laws is intimately linked with the history 
of algebraic number theory itself. Indeed, the historical evolution and generalization of 
the quadratic reciprocity law to residue symbols in algebraic number fields, essentially 
from Gauss to Artin, uses the techniques of algebraic number theory as an indispensable 
tool. Hence, understanding reciprocity laws is an integral part of algebraic number theory. 
Thus, we have left this topic to the concluding chapter, albeit we have already had a solid 
introduction via Definition 5.14 on page 229, Applications 5.1—-5.3 on pages 229-231, and 
Exercise 5.36 on page 232, as well as the applications in §5.7. Furthermore, we motivated 
this chapter with the generalization of the quadratic Gauss sum given in Exercise 5.52. 
Since we have already dealt with the quadratic reciprocity law, as mentioned above, we 
begin with the next level up. 


6.1 Cubic Reciprocity 


Reciprocity laws arise from the following question. Given a fixed n € N, for which primes 
p, is there a solution « € Z to the congruence 


x” =a (mod p), 


where a € Z is known? More generally, we have the following. 
Definition 6.1 — Power Residues 


If m,n € N and a € Z with gced(a,m) = 1, then we say that a is an n'” power residue 
modulo m provided that 


nm 


xz” =a (mod m) (6.1) 
is solvable for some x € Z. 
For instance, when n = 2, the residues are called quadratic residues, when n = 3 they are 


called cubic residues, when n = 4, they are called quartic residues, also called biquadratic 
residues, when n = 5 quintic residues, when n = 6, sextic residues, and so on. 


261 


262 6. Reciprocity Laws 


When n = 2 and m is a prime, we get Gauss’s Quadratic Reciprocity Law discussed in 
the preamble to this section on page 261. In this section, we study n = 3, called cubic 
reciprocity, predicted by Gauss. Eisenstein gave the first published proof of the Cubic 
Reciprocity Law in 1844—see Biography 3.10 on page 137. In this section, we will provide 
one of Eisenstein’s proofs of this law.®! First we need the following preliminary result, 
which is based upon ideal congruences introduced and explored in Exercises 4.25—4.32 on 
pages 163-164 with which the reader should be familiar at this juncture. 


Remark 6.1 Note that by Corollaries 1.1 on page 13 and 1.6 on page 21, Z[¢3] is a PID, 
equivalently a UFD by Theorem 1.18 on page 29. Thus, in what follows, the congruences 
modulo a prime element 7 of Z[¢3] may be interpreted as congruences modulo the principal 
prime ideal (7). 


Proposition 6.1 — Cubic Congruences 


Suppose that F = Q(¢3) and 7 is a prime element of Dr. If a € Or = Z[¢3] where 
Nr(z) #3 and 7 { a, then there exists a unique nonnegative integer n < 2 such that 


alNr(™)-1)/3 = (2 (mod zn). 


Proof. Since 
2 


aNr@-1_ 4 [[e@re-ve = G3), (6.2) 

j=0 
then given 7 { a, we must have that 7 divides one of the factors on the right side of (6.2). If 
m divides two of these factors, then 7 divides the difference of them. The possible differences 
are +(1 — ¢3), (1 — G3), and +¢3(1 — ¢3), and by Exercises 2.24 on page 68 and 3.37 on 
page 129, the absolute value of the norms of any of these elements is 3. Therefore, by 
Exercise 2.46 on page 86, via Remark 6.1, Nr(7) | 3, a contradiction since Nr(r) £ 1,3. 


Proposition 6.1 provides the evidence that the following is well-defined. 


Definition 6.2 — Cubic Residue Symbol 
Suppose that F = Q(¢3) and a is a prime element of Or with Nr(z) 43. Ifa € Op, then 


(*)3 is defined by 
(=), =0ift a, 
(<) =@ifmta, 


where n is the unique integer determined by the congruence in Proposition 6.1. 


and 


If 6 € Or is a nonzero, nonunit element, and 


m 
b= [Im 
j=l 


6.1 Jacobi had already worked out the laws in 1836 and written them down in notes for lectures given at 
K6nigsberg in late 1836 and early 1837. In 1846, Jacobi even went so far as to write in a footnote of a paper, 
which was a republished version of an 1837 paper, that Eisenstein had gotten the proof from Jacobi’s notes. 
Eisenstein responded in a paper published in Crelle’s Journal in 1847 that he had neither seen Jacobi’s 
lecture notes, nor was he aware of the proofs in them. See [12] for more historical details. Two proofs, 
essentially those of Eisenstein and Jacobi, can be found in [32]. 
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where 7; are prime elements of Op with Ne(a,;) #3 for j = 1,2,...,m, then (3) is defined 
by 


If 6 € Uo,, then set 


for all nonzero a € Dp, and 


For the following, the reader is reminded of the introduction of Gauss sums and related 
characters in Exercises 5.27—5.34 on pages 231-232. 


Remark 6.2 Suppose that F = Q(¢3), a,8 € Or, 7 is a prime element of Op, and (©); 
is the cubic residue symbol. Then immediately from Definition 6.2, 


01.0), 


and if a = @(mod 7), then 


Therefore (*)3 is a cubic character on the field Z[¢3]/(7) of Nr(m) elements, namely 


xy (a) = (£)3 is a multiplicative character of order 3 on the finite field Fy,.(7). 


By Exercise 6.2 on page 275, (©), = 1 if and only if @ is a cubic residue modulo 7. 
By Exercise 6.5, every nonzero element of Z[¢3] has six associates. Thus, to refine our 
development of the cubic reciprocity law, we need the following notion. 


Definition 6.3 — Primary Cubic Integers 


If t € Z[¢3], then we call r primary if t = +1(mod 3).°? In particular, if 7 is a prime 
element, then it is called a primary prime. 


Lemma 6.1 — Primary Property via Primary Factors 


Let F = Q(¢3) and let a € Op be a nonzero, nonunit element. Then a is primary if and 
only if there exists a decomposition in which all of its prime factors are primary. 


Proof. Let a = PS a; where each 7; is a prime element of Op andn EN. Ifa is primary, 
then 3 { Np(m;) for any 7 = 1,...,n. By Exercise 6.6, we may write 7; = ujp; where 
uj € Up and p; is a primary prime in Oy. Thus, we may write 


n 
II Uj = +63” 
j=l 


6.2Some texts are more restrictive in their definition of primary, namely they define these elements to be 
those 7 = 2(mod 3), instead of = +1(mod 3) (see [32, Definition, p. 113] for instance). However, the 
theory is made simpler by the more general congruence. 
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for m = 0,1,2. Therefore, a = +¢3” jes p;- Since a = +1 (mod 3) and for each j, pj = +1 
(mod 3), then ¢3’ = +1(mod 3). However, ¢; # —1(mod 3) since 1+ ¢3 is a unit by 
Exercise 4.20 on page 162. If ¢; = 1(mod 3), then 1 — ¢3 = 3G for some 6 € Or. Thus, 


1—¢3 = 36 = (1 — ¢3)(1— G3), 


so 1 = (1 — G3) 8 forcing (1 — ¢3) to be a unit contradicting the fact that 3 | (1 — ¢3). We 
have shown that m 4 1. If m = 2, and ¢? = 1(mod 3), then by a similar argument to the 
above, we get that 1—¢3 is a unit and this is a contradiction since 3 divides it. If m = 2 and 
¢? = —1(mod 3), then this contradicts the fact that 1+¢} is a unit. Hence, m 4 2. We have 
shown that m = 0. Therefore, a= + jas pj, a product of primary primes. Conversely, if 
a is such a product, then the product is congruent to +1 modulo 3, so a is primary. 


We need one more concept and the results related to it before establishing the Cubic Reci- 
procity Law. 


Definition 6.4 — Jacobi Sums 
Let x and 4 be characters on F, where q = p” for a prime p and n € N. Then 
q-1 
Ind) = D2 x(z)AQ — 2) 
x=0 


is called a Jacobi sum. If n = 1, we write J; = J for convenience. The order m of the 
Jacobi sum J;,(x,A) is the least common multiple of the orders of y and 4. Therefore, a 
Jacobi sum of order m is an integer in Q(¢,,). 


Lemma 6.2 — Properties of Jacobi Sums 


(a) Let x and A be characters on F, where q = p” for a prime p and n € N. Then 


In(X,A) = 

BON Goa) 

(b) If x = x) is a cubic character on F, where p = 1(mod 3) is prime, and 
I(X:X) = a+ Ws, 


then 3 | b and a = 2(mod 3). 


(c) For any prime p, 
I(e,x) = 0, 


where x is any nontrivial character on F,, and 


J(€,€) =p. 


Proof. (a) We have that 


GOGO) = SF xle pepe S> xaygp'e" 


wey yeF, 


= Exley = (LE eae ayo. (6.8) 


z,yCkg tcFy wx€Fg 


6.1. Cubic Reciprocity 
If t = 0, then 


S> x(2)x(t- 2) = S2 x(#)A(-2) = A(-1) SS xA(a) = 0, 


xeFg xeF, xeF, 
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by Exercise 5.28 on page 232, since yA # «. If t 4 0, then by replacing x by tx, we get, 


S7 x(w)A(t— 2) = S© x(tari)A(t — ta) 


xeFy 2, €F, 
= S> xAé)x(wi)A(1 — 21) = x(t) SS x(a) AC. = 21) = XA) In (GA). 
@1€Fg 21 EF, 


Hence, by substituting the above expression into (6.3), we get that, 


Trg /Fp (t) 


G(x)GA) = $0 AM In (KA Gp 


teF, 


= In(X,A)G(XA), 


which secures (a). 
For part (b), we first need the following. 


Claim 6.1 Suppose that y is a character of order n > 2. Then 

n—-2 ; 

G(x)” = x(-1)p [] F0ax?). 

j=l 

In particular, for a cubic character y = y‘), we have 
G(x)? = PIG): 
By part (a), G(x)? = J(x,x )G(x?). Multiplying this by G(x), we get 
G(x)? = Tx )G(X7 G(X) = TOG )IGx 7) G(X’). 

Continuing in this manner, we see that 


GO =I) TOG) IO EGP): 


= X by Exercise 5.27 on page 231, then 


G(x" )G(x) = GR)GE(x) = x(-Lp, 


Since x"~! = x71 


where the last equality follows from Exercise 5.54 on page 260. Therefore, G(x” 


x(—1)p/G(y), and substituting this into (6.4) yields Claim 6.1. 
By Exercise 5.52, 


p-l p-l 
G(x)? = D0 xP @)G" = DG" = -1, 
x=0 r=1 


(6.4) 


where the last equality follows from Example 1.5 on page 2, and the penultimate equality 


comes from the facts that +(0) = 0 and y3(x) = 1. Therefore, by Claim 6.1, 


G(x)? = pJ(x,.x) =a+b¢3 =—1 (mod 3). 


(6.5) 
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By part (c) of Exercise 5.27 on page 231, 
G(x)? = pJ(X,X) = a+ bs = —-1 (mod 3). (6.6) 
Thus, subtracting (6.6) from (6.5) yields, b(¢3 — ¢3) = 0(mod 3), namely 
b/—3 =0 (mod 3), 
which implies that 3b? = 0(mod 9), thereby forcing 3 | b. Hence, from (6.5), 
a+6¢3 =—-1 (mod 3), 
so a = 2(mod 3). 
In particular, if y is a cubic character, then 
G(x) = x(—DpJ (xx) = pPIOGX), 


since yx(—1) = x((—1)?) = x3(—1) = 1. This secures part (b). 
(c) The first assertion is immediate from Exercise 5.31 on page 232 and the second assertion 
is immediate from Definition 6.4. 


Lemma 6.3 — Cubic Jacobi Sums 


Suppose that F = Q(¢3), 7 is a primary prime element of Or, and Nr(z) = p = 1(mod 3). 
Then 


(3) (3), __ J —m ifm=1 (mod 3), 
Joe > Xa =| T if7m7=-—l (mod 3). 


Proof. For the sake of simplicity of notation, we set x8 = = x for the balance of the proof. 
Let J(x,x) = a+ b¢3. By part (b) of Lemma 6.2 on page 264, a = 2(mod 3) and 3 | b. 
Also, by Exercise 5.54 on page 260 and Claim 6.1 


Ne(J(x,x)) = Ne(G(x)*/p) = Ne(x) =p. 


Therefore, J(y,x )J(x,N ) = p = 17, so 7 | J(x,x ) Or 7 | J(x,x ). We now show that the 
former holds. We have, 


p-1 
=F x0 (1 — 2) = aD — gy “0/3 
«x=0 


at (p-1)/3 
=S 2s & ) (—2)i 


j=0 z 


1 


(p—1)/3 ( 1)/ Pp— 
= S- cs “V1 Nae? (p-1)/3+3 (mod 7), 
J 


j=0 


where the middle congruence comes from the Binomial Theorem. Also, we have that 
(pa is+g<p—1 


for j = 0,1,...,(p — 1)/3, so by Exercise 6.13 on page 277, S~?=5 a@—)/3+) = 0 in Fy. 
Hence, J(y,x ) = 0(mod 7). Since 7 | J(x,x), which is itself a primary prime element of 
Op given that J(x,x ) = a = —1(mod 3), then J(y,x ) = £7, and the result follows. 
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Theorem 6.1 — Cubic Reciprocity Law 
Let a,6 be relatively prime primary elements of Or where F = Q(¢3). Then 


(3),- (2) 
BY a) 
Proof. If a € Uo,, then a = +1 since a is primary and ‘a 1(mod 3) for 7 = 1,2. 


Thus, by definition, 
Ole 
ee ee a ey 
BY O73 


If a is not a unit, then by Remark 6.2 and Lemma 6.1 on page 263 it suffices to prove the 
result for the case where a,(@ are primary primes. 


Case 6.1 a,@ are rational primes, which are inert in F’.. 


Since a and # are relatively prime, then by the last statement of Exercise 6.4 on page 275, 
§).--(9), 
BY OF-8 


= 


This completes Case 6.1. 


For a given prime element 7 of Or, set yx (¥) 7) = (2)3 in what follows. 
Case 6.2 a = q = 2(mod 3) is inert and 6 =a with Nr(7) = p= 1(mod 3). 


By Lemma 6.3 and Claim 6.1 on page 265, 


COa)? =F Cue %e)-— tom: 


Therefore, G(yx)% ~! = (pr)V -Y/3 = yq(pm) = xq() (mod q), where the last congruence 
follows from Remark 6.2 on page 263 since p = 1(mod 3). Thus, 


2 


G(X)" = Xq(™)G(Xx) (mod q). (6.7) 


However, by the Multinomial Theorem—see Theorem A.25 on page 341— 


Save yor" ” (mod q). 


Also, since gq? = 1(mod 3) and y,(a) is a cube root of unity, then 
p-l 
2 
S> Xm ()CB Ss Xn (x Ga2(xr), (6.8) 
x=0 
and by Exercise 5.52 on page 260, 
Gq2 (Xn) = Xx(V)G(xx): (6.9) 


Combining (6.7)—(6.9), we get 


Xn (GP )G(Xr) = Xq(7)G(xx) (mod gq). 
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However, by part (c) of Exercise 5.27 on page 231, 


so 
Xn(QG(Xn) = Xq(7)G(Xx) (mod q). 


Since G(x)G(X,,) = p from Exercise 5.54 on page 260, then multiplying the latter congru- 
ence by G(X,,), we get 
Xn(Q)P = Xq(7)p (mod q), 


so we may divide out the p and use the uniqueness given in Definition 6.2 to conclude that 


Xn(Q) = Xq(7), 


which completes Case 6.2. 


Case 6.3 Assume that a = 7 with Ne(m) = p = 1(mod 3) and 6 = m with Np(m2) = 
q = 1(mod 3), and p¥ q. 


Since 7 is primary, then as in Case 6.2 


G(xm, 271 = (4p71)-Y9 = Xang (p71) (mod 72). 
In other words, 
G(xa1)* = Xa (p™1)G(xw,) (mod 72). (6.10) 


However, as above, by the Multinomial Theorem and Exercise 5.52, 


Gxm)? = D0 x4, OG? = Gy(xm) = Xe. (DE(xm) (mod q), (6.11) 
x=0 


where Xz, (q) # 0 since p 4 q. Comparing (6.10) and (6.11), and using part (c) of Exercise 
5.27 we get, 


Xa (PT1) = Xx, (9) = xm (4) = xm (9)? = xm (97). 
We have shown that 
Xo (PT) = Xm (9"). (6.12) 


Now we repeat the above argument that led to (6.12), with the role of 72 replacing that of 
71, and 7 replacing that of 72. Then instead of (6.12), we get 


Xi (qr) = Xr2 (p*). (6.13) 
Also, by Exercise 6.4 on page 275, and part (c) of Exercise 5.27, 
xm (9°) = Xn, (4°) = Xm (9): (6.14) 


Multiplying (6.12) by yx, (72) we get, 
Xm (T2)Ximo (D1) = Xara (T2) Xa (7): (6.15) 
Also, by multiplying (6.14) by xz, (72), the latter equals, 


Xa (2) Xai (G) = Xi (G72), (6.16) 
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and by (6.13) , 
Xrg(D") = Xrg(DT1T1) = Xrrg (1) Xo (VT). (6.17) 
Hence, from (6.13)—(6.17), 
Xa (12) X mo (p71) = Xe T™1)Xar9 (p71). 
Dividing out by x7,(p71) yields, 


Xr (2) = Xt 74), 


which establishes Case 6.3. 


We have proved the Cubic Reciprocity Law for all except the following (often overlooked) 
case. 


Case 6.4 °? Assume that a= 7 ¢Z is a primary prime and B =F. 


By Remark 6.2 on page 263 and the cases already proved, 


eG)! 


since —1 is a cubic residue, and since —7 = 7(mod 7 +7), then this in turn equals, 


and the proof is complete. 


Corollary 6.1 Let p = 1(mod 3) be a prime with p = 77 where 7 = a+ b¢3 € Z[¢3]. If 
A, B € Z are such that 
4p = A? +2737, 


then A is a cubic residue modulo p. 


Proof. Since 


by Exercise 6.4, then necessarily 


@,-,- 9 


by the cubic reciprocity law. Also, we have, 
r+T=a+be3+a+bG =2a- 6, 
where the last equality comes from Example 1.5 on page 2. 
Claim 6.2 For any prime p = 1(mod 3), there are unique A, B € N such that 
Ap = A® + 27B?, 


with A = +1(mod 3). Here we say “unique” in the sense that, although —A and —B will 
also satisfy the equation, they are not natural numbers. Usually, one says that the A and 
B are unique “up to sign.” Our choice of only the positive sign by selecting only natural 
numbers ensures uniqueness of sign. 


6-3The elegant proof of this case is due to Ron Evans, who suggested it in the writing of the first edition, 
as is the suggestion of Corollary 6.1, an application of cubic reciprocity based upon this case. 
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By the proof of part (b) of Lemma 6.2 on page 264, 


J(x,x) = at b3 


with 
JOux)|? =p 


and 
p=a*—ab+b?’. 


Set A = 2a—b €N and B = |b|/3. Then 
Ap = (2a — b)? + 3b? = A? + 27B?. 
Since 3 | b and a = 2(mod 3) by part (b) of Lemma 6.2, then it follows that 


A= |2a — b| =+1 (mod 3). 


Uniqueness of representation is shown by choosing A to be the smallest such value, from 
which it follows that there can be no other representation of this type. This is Claim 6.2. 


By Claim 6.2, 
Ap = 4(a + b¢3)(a — b¢2) = 4a” — 4ab + 4b? = (2a — b)? + 30? = A? + 27B?. 


Therefore, from (6.18) and the fact that 7 + 7 = A, we get: 


That A is a cubic residue modulo p will follow from the next result. 


Claim 6.3 For a given 6 € Z[¢3], 6 = y3(mod p) for some y € Z[¢3] if and only if 6 = a3 
(mod 7) and 6 = 6? (mod 7) for some a,3 € Z[¢3]. Furthermore, 5 = y?(mod p) for some 
7 € Z[¢3] if and only if 6 = a? (mod p) for some a € Z. 


By Theorem 1.21 on page 32, for any a, € Z[¢3], there exists a y € Z[¢3] such that 
y=a (moda) andy=8 (mod7), 
from which the first result clearly follows. For the other assertion, we note that 
Op/n =LlCs]/0 & Z/pL, 


since Np(m) = p—see Definition 5.1 on page 182. Therefore, given y € Or, there exists 
a rational integer a such that y = a(mod 7). Thus, if 6 = y?(mod p), then we have that 
5 = a®(mod p). The converse is trivial, with a = 7. 


Example 6.1 Let p = 19. Then 4-19 = 7? + 27. Thus, by Corollary 6.1, there exists an 
x € Z such that 7 = x? (mod 19). In fact, 7 = 4° (mod 19). 


Corollary 6.1 is an application of Case 6.4 in the proof of Theorem 6.1. We may exploit 
that case further to motivate another application of the Cubic Reciprocity Law. 
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Example 6.2 Let a = 24+ 3/-3 = 7 and 6 = 2- 3V-3 = 
elements of Z[¢3] = Z[(1 + /—3)/2] = Or, and Ne(7r) = Npe( 
explicitly illustrate Case 6.4, we have, 


gq Ne(m)-1)/3 — (24 3,/—3)9 =1=¢2 (mod 7), 


, both clearly primary 
) = 31 = p. Also, to 


T 
T 


since 
(2 + 3V/—3)!° = 24663337 — 8393412/—3 = 1 + (2 — 3V—3) (4027980 + 1845264V/—3), 


so (2 — 3\/—3)!9 = 1 = C2 (mod 7) as well, since 


(2 — 3/—3)'° = 14 (2 + 3V—3) (4027980 — 1845264,/—3). 


(2),- 5), -1- (ss), - 


p= 31 = 27 +27, 


Thus, 


Notice that 


a case not covered by Corollary 6.1, in the sense that we cannot use it to determine if 2 is 
a cubic residue modulo p. The following result does tell us how to determine when 2 is a 
cubic residue modulo p in the general case. 


By Exercise 6.2, we know that a € Or = Z[¢3] is a cubic residue modulo a prime element 
nm € Or if and only if (¢)3 = 1. In particular, the cubic residuacity of 2 is of special 
importance from both a historical perspective and from the point of view of representation 
of rational primes as norms of cubic integers. In order to establish such results we need the 
following, which was proved by Gauss in 1801—see Biography 3.5 on page 95. 


Theorem 6.2 — The Cubic Residuacity of 2 
Let p = 1(mod 3) be a prime and let 4p = a? + 3b”, where a = 1(mod 3) and b= 0(mod 3) 
are the unique natural numbers determined in Exercise 6.10 on page 276. Then 


2=-x° (mod p) for some x € Z if and only if 2 | a. 


Proof. Since p = 1(mod 3), then it follows from Remark 1.24 on page 52 that p = m7’ 
where 7 is a prime element of Or = Z[¢3] and 7’ is the algebraic conjugate of 7. We may 
let 7 = (a + bV—3)/2 since 


2 3b2 
Np(n) = a7 =p= ——. 

If 2 = 2° (mod p), then 2 = x3 (mod 7), so 
at’ = = 7 =1 (mod 2), 


by Exercise 6.3. This in turn holds if and only if 


a+b 
2 


=1 (mod 2) and b=0 (mod 6). 


Together, these imply that 2 | a. 
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Conversely, if 2 | a, then necessarily 2 | b, so we may write a = 2a, and b = 2b,. Therefore, 


ot iVS ya (m—t tvs 
- 2 mes 3 


) =1 (mod 2), 


so by Exercise 6.3 and Claim 6.3 on page 270, 2 = x? (mod p) has a solution x € Z. 


Remark 6.3 The reader is cautioned that ( *)3 = 1 does not necessarily imply that 2 = x? 


(mod p) has a solution « € Z. Exercise 6.4 tells us that in fact (2)s = 1 for any prime 
p > 3. Consider the following example, which motivates the next result conjectured by 
Euler. 


Example 6.3 Let p = 7, so p = 17, in Or = Z[¢3], where 7 = (5 + V—3)/2. Thus, one 
calculates that 


(2) = 9(Ne()-0/3 — 4 = ¢, (mod 7), 
3 


TT 


since 


4 


sees (G4?) (44) -a-m, 


where y = (—3 + V—3)/2. Similarly, 


Therefore, 


oe) 7 ey eh = (3¢3 = 1. 


Yet, 24 x°(mod p) for any x € Z, by Theorem 6.2, since 2{ a= 1 where 4-7 = 1743-37. 


Theorem 6.3 — Prime Representation and Cubic Residuacity®* 


If p is a rational prime, then there exist 2, y € Z such that p = «7 + 27y? if and only if p=1 
(mod 3) and 2 = z3(mod p) for some z € Z. 


Proof. If p = 1(mod 3) and 2 = z3(mod p) for some z € Z, then by Theorem 6.2 we have 
that 2 | a, so p = a + 27y?, where « = a/2 and y = 6/6. Conversely, if p = x? + 27y?, 
then certainly p = 1(mod 3). Since 4p = (2x)? + 3(6y)?, then by Theorem 6.2, we have the 
result. 


In the next section, we will see Gauss’s proof of another of Euler’s conjectures, this time 
using biquadratic reciprocity. 


Example 6.4 Returning to a consideration of Example 6.3, we see that 7 certainly cannot 
be represented in the form x? +27y?. What is hidden here is that in the non-maximal order 
Z|V—27|—see Remark 3.5 on page 99—the ideal P = [7,1 + V—27] is not principal. In 
fact, if p = 1(mod 3) is prime, then by Exercise 6.1, there is a rational integer b such that 


6-4Fuler conjectured this in Tractatus de numerorum doctrina capita sedecim quae supersunt, which he 
wrote during the years 1748-1750. However, the work was not completed and did not get published until 
1849 (see [18]). Gauss was the first to prove the result as a consequence of his work on cubic residuacity 
including the result in Theorem 6.2. 
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—27 = b* (mod p). Thus, by Exercise 6.14, there exist 2, y € Z such that p = x7+27y? if and 
only if the ideal P = [p, b + /—27] is principal in Z[,/—27].°° Therefore, P = [7,1 + /—27| 
is not principal in Z[/—27]. What Example 6.3 shows is that p = 7 is a product of prime 
elements 77 in Z[¢3] where 1,7 ¢ Z[/—27]. Thus, Euler’s criterion given in Theorem 6.3 
says that primes p = 1(mod 3) are representable in the form p = x? + 27y? if and only if 
am € Z|,/—27]. Thus, we have a surprisingly simple interpretation in terms of ideal theory. 


Excluded from the Quadratic Reciprocity Law is the fact (given in (A.10) on page 342), 


that 
(j) - Deve, 
Pp 


which is therefore called the Supplement to the Quadratic Reciprocity Law. We close this 
section with the cubic analogue of this fact. 


Theorem 6.4 — Supplement to the Cubic Reciprocity Law 
Let t = —1+3m+43n¢3 € Or = Z[¢3] be a primary prime element.°° Then 


(—*) _ 72m 
T oo 
3 


Proof. Suppose first that 7 = q = —1(mod 3), so g = —1+ 3m. Since 


(1—¢3)? = —3¢g, (6.19) 


then we need only show that e= )3 = ¢3”. By the last statement in Exercise 6.4, 


By of S83) (EN 21) 2 eS 2 7 8 Bie oh 
( T ),-(), (8), (2). 3 63 3 G3". 


Now we let a = 3m — 1 and b = 3n where n ¥ 0, and ged(a,b) = 1, so 7 = a+ b¢3 with 
= 6.7 
p= TT. 


Claim 6.4 


This follows from the fact that 
C3. cheb 
rom — ¢3 ’ 
3 


T 


and (p — 1)/3 =n-—2m=m-+n(mod 3), since p= Np(m) = a? — 2ab + b?. 


Claim 6.5 


6.5This phenomenon is studied in detail in [49]. Therein, such so-called non-mazimal orders as Z[,/—27| 
and their relationship with the maximal order or ring of integers such as Z[¢3] = Z[(1 + V/—3)/2] is explored 
in depth. See also, [50, pp. 349-352] for an overview of the above from an elementary standpoint. 

6-6 There is no loss of generality in assuming that s=—1(mod 3) since one of + must satisfy the congruence. 
Also, by Example 1.5 on page 2, there is no need for a term involving ¢? in the given expression for 7. 

6.7 The ideas used in the balance of the proof are due to K. S. Williams [72]. 
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By the Cubic Reciprocity Law and the last statement in Exercise 6.4, 


ay _(T\ _ (atbls\ _ (bGs\ _ (6 G3) .(a?=1)/3 
(2),-(2),=( a ).-(@) -@).(@),-4 
However, since (a? — 1)/3 = m(mod 3), then we have Claim 6.5. 
Co ae 
3 3 
Since (a + b)¢3 = —a(1 — ¢3) (mod 7), then 


(ot) _ (e488) = (aa) _ @): (*) (: =) 


so by Claims 6.4-6.5, this equals 


Co grtm (1) = » (=) 
3 $3 - —~ $3 1 ’ 
a 3 


which establishes Claim 6.6. 


Claim 6.6 


Claim 6.7 


Tw = are 
at+b/. : 


Since 7 = —b(1 — ¢3) (mod a + b), then 
us oof SU): oof ad Ley. 2f PG 
a+b}. a+b 3 at+b/,\at+b/, abo fs” 
where the last equality comes from the last statement in Exercise 6.4. However, by (6.19), 
PG \, uf Gey 2 —363 es -3\*/ @ \? 
at+b/., a+b /, at+b/., at+b/, \a+b @. 
and by the last statement in Exercise 6.4 again, this equals 
( 3 ) : a 3 ((a+b)?—1) _ -m+n 
wae © — S38 —_ ¢3 b) 
3 


which completes the proof of Claim 6.7. 
By Claims 6.6-6.7, and the Cubic Reciprocity Law, 


_ b TT m+n m 
Se ee a i 


which establishes the result. 
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Exercises 
6.1. Let F = Q(¢,) where n € N, a € Op, 7 a prime element of Or with Npr(7) = q, and 
g = gcd(n,q— 1). Prove that 
x” =a (mod 7) 
has a solution « € Or if and only if 
alt-)/9 =1 (mod 7). 
(Hint: Use Theorem A.8 on page 331 and Theorem A.24 on page 340.) 
6.2. Suppose that F = Q(¢3), a,8 € Or, m is a prime element of Op, (&) is the cubic 
residue symbol given in Definition 6.2 on page 262, and Nr(a) 4 3. Prove that 
(=) = 1 if and only if 63 =a (mod 7), 
1/3 
for some 8 € Op. 
(Hint: Use Exercsie 6.1.) 
6.3. Let F = Q(¢3) and a is a primary prime in Dp, with a ¢ Z. Prove that 
6B? =2 (mod nz) 
has a solution 8 € Or if and only if 
mt =1 (mod 2). 
(Hint: Use Proposition 6.1 on page 262, Exercise 6.2, and the Cubic Reciprocity Law.) 
6.4. Suppose that a,8 € Z[¢3] = Or where a is a prime element with Nr(a) 4 3 and 
a{ G. Prove that 
x, (8) = xe 8). 
Use this to deduce that for r,s € Z with gcd(r, s) = 1 and 3{ s, 
hee 
8/3 
(Hint: Use Exercise 6.2.) 
6.5. Prove that every nonzero element of Z[¢3] has six associates. 
6.6. Let F = Q(¢3), and let a € Dr be a nonzero element such that 3{ Ne(a). Prove that 
exactly two of the associates of a are primary, and that if 6 is a primary associate of 
a, then —( is the other one. 
6.7. Prove that 


T(x ~") = xt); 
for any nontrivial character y on F, where p is prime. 
(Hint: Use Exercise 5.28 on page 232.) 


Exercises 6.8-6.12 are designed as applications of Jacobi sums to certain Diophantine 
equations, especially over finite fields, not covered in the main text. For more information, 


see [3]. 
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6.8. Let x and \ be characters on F,, where p is prime and y,A,yA # €. Prove that 


JOGA] = VP. 
Use this fact to prove the following. 


(a) Ifp=1(mod 4), then there exist a,b € Z such that p = a? + b?. 
(b) If p=1(mod 3), then there exist a,b € Z such that p = a? — ab + b?. 


(Hint: Use Exercises 5.30 on page 232, 5.54 on page 260, and part (a) of Lemma 6.2 
on page 264). 


6.9. Suppose that 
n 
k; 
F(ti,...,2n) =aot+ Saye 
j=l 
where a; € Fy, n,k; € N, and let Ny be the number of solutions of 
Pl Siseeo5 Lia) =0 
in F,, where g = p™, m € N for some rational prime p.°° Prove that if 
f(x) =a" -a 


for a € FX, n € N and a prime p= 1(mod n), then 


Ny.p = S- x(a), 
j=l 


where x is a character of order n on F,. In particular, use this fact to prove the 


following, where ( 5) is the Legendre symbol. If 


f(a)=2* —a 


N 1+ (4) 
fp p : 


(Hint: Use Exercises 5.80 and 5.82 on page 232.) 


and p > 2, then 


v« 6.10. With reference to Exercise 6.9, suppose that p = 1(mod 3) is prime and 
fay =e? +y-1 


Prove the following result due to Gauss. There are A,B € Z with A = 1(mod 3) 
uniquely determined such that 


4p = A? + 27B?, and Nyy =p—2+A. 


(Hint: Use Exercise 6.7, parts (b)-(c) of Lemma 6.2, and the proof of Claim 6.2 on 
page 269.) 


6.8Equations of this form are called diagonal equations. For an in-depth analysis of such equations, see [3] 
and [43]. 


6.1. 


6.11. 


6.12. 


6.13. 


6.14. 


6.15. 


Cubic Reciprocity 


With the notation of Exercise 6.9 in place, prove that if 
f(y) =a? +y?-1 


and p > 2, then 


Ne. = p-1 ifp=1 (mod 4), 
fe) n+1° ifp=—l1 (mod 4). 


(Hint: Use Exercises 6.7, 6.9, and see the solution to Exercise 5.83 on page 398.) 


Let p = 1(mod 3) be prime and set 
f(z,y) = 2? ty? 1. 
Prove that, in the notation of Exercise 6.9, 


INpp —p+2| < 2y/p. 


Let k € N and p > 2 prime. Prove that 


Let p = 1(mod 3) be a rational prime, and (as shown in Example 6.4), let b € Z such 


that b? = —27(mod p). Prove that there exists x,y € Z such that 
p=2" +277? 


if and only if the ideal 


P= [p, b+ V —27] 
is principal in Z[./—27]. 


With reference to Exercise 6.14, prove that P? ~ 1 in Z[,/—27]. In fact, it can be 
shown that the class number of Z[,/—27] is 3 see [49, Footnote (1.5.9), pp. 25-26]. 


Biography 6.1 Carl Gustav Jacob Jacobi (1804-1851) was born in Potsdam 
in Prussia on December 10, 1804, to a wealthy German banking family. In 
August of 1825, Jacobi obtained his doctorate from the University of Berlin 
on a topic involving partial fractions. The next year he became a lecturer at 
the University of Konigsberg and was appointed professor there in 1831. Ja- 
cobi’s first major work was his application of (his first love) elliptic functions to 
number theory. Moreover, Jacobi and his good friend Dirichlet both generated 
their own brands of analytic number theory. As well, Jacobi was interested in 
the history of mathematics and was a prime mover in the publication of the 
collected works of Euler—a task, incredibly, not completed fully to this day. 
Outside of number theory, he made contributions to analysis, geometry, and 
mechanics. Although many of his colleagues felt that he might work himself to 
death, he died of smallpox on February 18, 1851. 
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6.2 The Biquadratic Reciprocity Law 


The doors we open and close each day decide the lives we lead. 
Flora Whittemore—see [14] 
American Homesteader 


Gauss was the first to state the Biquadratic (or Quartic)®*® Reciprocity Law, but he never 
published a proof. However, in [19, pp. 101-171], he made serious use of complex numbers 
to discuss biquadratic residues.°!° Eisenstein was the first to publish a proof in 1844. 
Indeed, Eisenstein went on to publish five separate proofs of this law between 1844 and 
1847. In order to present a proof in this section, we must develop a theory analogous to that 
developed for the cubic case. We begin with the following quartic version of Proposition 6.1 
on page 262. 


Proposition 6.2 — Quartic Congruences 
Let F = Q(¢4) = Q(t), and let 7 be a prime element of 


If 
aé€ Or, t{aand Ne(r) F¥ 2, 


then there exists a unique nonnegative rational integer n € {0,1, 2,3} such that 


alNe(m)-D/4 = 5” (mod zr). 

Proof. From Exercise 4.31 on page 164, it follows that since F’ is a PID, 
aNr(™)-1 =] (mod x), 

and by Remark 1.24 on page 52, 


Nr(x) =1 (mod 4). 


Thus, a(%7(™)—-0/4 is a root of ct = 1(mod 7), as are +1 and +i. Hence, a\¥r(*)-1)/4 
must be one of i” for 0 <n <3 modulo 7m. 


Proposition 6.2 now allows us to formulate the following. 


Definition 6.5 — Biquadratic/Quartic Residue Symbol 


Let 7 be a prime element in Or = Z[i] with Nr(z) 4 2, and let ac Or. If t+ a, then we 
set 


6.9We will use the terms quartic and biquadratic interchangeably. 
6.10Indeed, Gauss was the first to use the term compler number and introduced the symbol i for ,/—1—see 
[11, p. 254].Thanks to Gopala Srinivasan for pointing out the latter reference. 
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If 6 = m72--- Tm, where each 7; 1 < j < m € Nisa prime element of Op with Np(z;) 4 2, 


ees (2) : ey (2) (=). 


If 6 € Uo,, then set 


for every nonzero a@ € Op, and set 


Now we establish some properties of the quartic residue symbol. 


Proposition 6.3 — Properties of the Quartic Residue Symbol 


Let a,8 € Or = Zi], and let 7 be a prime element of Or. Then each of the following 
holds. 


(a) Ifa, are both nonzero and gcd(a,@) = 1, then 


where Z is the algebraic conjugate of x.°14 


(b) If af a, then 


(<) = 1 if and only if at =a (mod 7) has a solution x € D%,. 
w/a 


© (#),=().G), 


(d) Ifa@= (mod 7m), then (2), = (2). 


Proof. (a) By Definition 6.5 and Proposition 6.2, it suffices to prove this for 6 = 7, a prime 
element of Op, so a\Vr(™)—-1)/4 = (“), (mod 7), which implies 


a(Ne(m)-1)/4 = @). (mod 7). 
TIA 


Also, by Definition 6.5, 


(2) = ger @=1/4 @ g(Ne(n)=0/4 
Ta 


so 


and multiplying both sides by (©) , yields part (a). 
Part (b) follows from Exercise 6.1 on page 275, and parts (c) and (d) are immediate from 
Definition 6.5. 


6.11Note that in this case, = x’ is both the algebraic and complex conjugate of « = a + bi for the specia 
case of the Gaussian integers, since x’ = a — bi = @. 
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Remark 6.4 From Proposition 6.3, we see that (*), is a quartic character on the field 
Z|t|/(x) of Nr(m) elements, called a quartic residue character, and we write 


Now we introduce the quartic analogue of Definition 6.3 on page 263. 
Definition 6.6 — Primary Gaussian Integers 
Ifa =a+ bi € Z[i], then a is said to be primary if 


a=1 (mod 2),b=0 (mod 2) anda+b=1 (mod 4). 


Lemma 6.4 — A Formulation for Primary Gaussian Integers 
a=a-+bi € Zi] is a primary if and only if a+ bi = 1(mod 2 + 22). 


Proof. If a+ b= 1(mod 4), where a is odd and 6 is even, then 


: -l+a+b 1—a+b 
a+bt=1+ | 


4 4 ):) (2+ 21) =1 (mod 2 + 2%). 


Conversely, if a + bt = 1(mod 2 + 27), then there exist c,d € Z such that 


a+ bi =1+ (c+ di)(24+ 21) = 14 2c — 2d +4 (2c + 2d)i. 
Thus, by comparing coefficients, 
a=1+2c—2d=1 (mod 2), b=2c+2d=0 (mod 2), 


and 


a+6= (1+ 2c — 2d) + (2c+ 2d) =1+4c=1 (mod 4). 


Remark 6.5 By Proposition 6.4, the only unit that is primary is 1. Also, by Exercise 6.19 
on page 292, a Gaussian integer not equal to 1 is primary if and only if it can be factored 
into a product of primary Gaussian primes. Also, if a@ is primary, then (1 +7) fa. Any 
Gaussian integer not divisible by 1 +7 is said to be odd. This is in keeping with the fact 
that if (1+) | a, then 2 | Np(a). Given an odd Gaussian integer, exactly one of its four 
associates is primary. This is the quartic analogue of Exercise 6.6 on page 275. 


We now establish some properties of primary integers. 


Lemma 6.5 — Properties of Primary Integers 


Let a = a-+ bi be a primary element of Z[i] = Or. Then 


(=) oe 
Oy 4 


Furthermore, if a = 7 with Ne(z) = p = 1(mod 4) is a primary prime element of Op, then 


DO ie DOr i 
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Proof. First, we show that to prove the first assertion it suffices to prove the result for 
Q=7,a primary prime. Let a=a-+bi and 8 = c+ di be primary Gaussian integers. Then 
b = 1 —a(mod 4) and d = 1 — c(mod 4), which together imply that (1 — a)(1 — c) = bd 
(mod 8). It follows that b+ d= 1-ac+bd(mod 8). Thus, if 


(=) — {1-0/2 and (5) = {(-9)/2, 
a) 4 Bi) 4 


(<) (5) _ j(1-a)/2,(1-c)/2 _ g(b+a)/2 _ {(1—actbd)/2 _ (=) 
al,\B/4 aB } 4 


Hence, we may assume that a = 7 = a+ bi is a primary prime element. Therefore, 


then 


d — j(Nr(®)-1)/4  ;(a?+6?-1)/4 _ j(1-a)/2 
a+bi/, ; 


where the last equality comes from the fact that a? + b? — 1 =2-— 2a(mod 16), given that 
b=1-—a/(mod 4). This establishes the first assertion.® 1? 


To prove the second assertion, we set x = x for simplicity. By Exercise 6.8 on page 276, 


JOxGX )IOGX) = p= 1T. 


By the same reasoning as in the proof of Lemma 6.3 on page 266, J(x,x ) = 0(mod 7). How- 
ever, by Exercise 6.8 again, Nr(J(v,xv )) = p. Therefore, J(x,x ) is prime by Exercise 1.27 
on page 19. Thus, there exists u € Up, such that 


ud (x,x) = 7. (6.20) 


Claim 6.8 
(1)? 9/4 1(y,x) 


is primary. 


Since x(x) = x(p — x) for x = 2,3,...,(p — 1)/2, then by Definition 6.4 on page 264 


(p—1)/2 2 
Joox)=2 Sy xed —2) +x (2) 


Since x(x), (1 — %) € Up,, then 
x(a) =x —2) =1 (mod 2 + 2%). 


Therefore, 


Joox) =2(25*) (Pet) (mod 2 + 2i). 


However, as well we have that 


x S) = x(274)? = x(2)-? = x(-4(1 +6)? = x)? = x((-)) = x(-1), 


6.12We observe that this first assertion is often called one of the supplementary laws to the Biquadratic 
Reciprocity Law—see Theorems 6.7 and 6.8 below for the others. 
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and since p = 1(mod 2 + 22), then 


Joox) =2 (25) + x(-1) =-24+ y(-1) (mod 2 + 2%). 
Therefore, 
—x(-1)J(x,x ) = 2x(-1) —1=1 (mod 2 + 2%), 


where the last congruence follows from the fact that y(—1) = +1. By Lemma 6.4 on 
page 280, —y(—1)J(x,x ) is primary. Since z is primary, then the congruences in Defini- 
tion 6.6 on page 280 imply that 


a+l_ pt+3 


Thus, since —y(—1) = (—1)*+)/?, then (—1)+9)/4 J(y,x ) is primary. This is Claim 6.8. 
Now, by Claim 6.8 and Exercise 6.20 on page 292, u = (—1)*+°)/4, so the result follows by 
multiplying (6.20) on page 281 through by (—1)@+°)/4, 
At this juncture, we have developed sufficient machinery to establish the Quartic Reciprocity 
Law. The following proof is similar to the proofs given by Eisenstein and Jacobi using the 
theory of cyclotomy.®!3 


Theorem 6.5 — The Biquadratic Reciprocity Law 
Let a=a+bi,8 =c+4+di € Zii] such that gcd(a,8) = 1 with both a and § primary. Then 


(3),-(8),c08" wa 


Proof. We break the proof into two cases. 
Case 6.5 a € Z. 


In this case, (6.21) becomes 


5).-(0, wn 


By the factorization property given in Definition 6.5 on page 278, we may assume without 
loss of generality that a = +p, where p is an odd rational prime (since 2 is not primary), 
and ( is a Gaussian prime. The case where { € Z is covered by Exercise 6.16, so we assume 
that 6 ¢ Z and N-(8) = p = 1(mod 4). For simplicity, we set y = x. First we assume 
that a = —q where g = 3(mod 4) is a rational prime. 


By Lemma 6.5 on page 280, 


J(x3x) = UT. 
Also, by Theorem A.25 on page 341, since g = —1(mod 4), 


JI"(x,x) = J(X,X) (mod q). (6.23) 


6.13The term cyclotomy refers to the theory of cyclotomic numbers, which may be defined as follows. 
For a given odd prime power p”=q=kf+1 where f,k,n€N with k>2, fix a primitive root g modulo q. Then 
given integers s and t, the cyclotomic number (s,t), of order k is the number of ordered pairs of integers 
(a,b) with g?*+s+41=g>*+* for 0<a,b<(q—1)/k. Thus, we see that the theory of cyclotomy essentially involves 
consideration of equations of the form az*+by*=1. The theory of cyclotomy was originated by Gauss. 
Kummer first observed the connection between Jacobi sums and cyclotomic numbers. Later, interest in the 
theory was renewed by the work of Dickson, and authors of the modern day keep the flame burning. See 
[3] for an in-depth analysis of this theory and its consequences. 
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Therefore, 


mi = IT Ox) = IK DIX) = |JOGx)/? =p (mod 4g), (6.24) 


where the last equality comes from Exercise 6.8 on page 276. Moreover, by Exercises 5.27 
on page 231, 5.52 on page 260, 6.17 on page 292, (with 6 = q), and Theorem A.25 on 
page 341, (which allows us to bring the gq inside the sum), 


G(x) = x*(MG(x%) = x(MG(x) (mod q). (6.25) 


Multiplying (6.25) through by G(x), and using Exercise 5.54, we get 


G™ (x) = G(x)x(G(X) = x(—1)px(q)_ (mod q). 


Hence, 
G™"(x) = x(-4@)p (mod q). (6.26) 
From Exercise 6.21 in conjunction with (6.24) and (6.26), we get 
x(—q)rtt! = y(-@)p = G(x) = (G2() (OY? 


(VDI (x,x)) SP? = (pI? (xx ))EtP/4 (mod q), 


and since J>(x,x ) = 7, by Lemma 6.5, then the last congruence becomes 


(eae PEN Gites \(mnod a) 
where the last congruence follows from (6.24). Therefore, we have shown that 
& = y(—¢) = rt 9G4D/4- FD) = “D4 = (=) (mod q). 
4 da) 4 
Thus (6.22) holds for a = —q = 1(mod 4). 


Now we assume that a = q where g = 1(mod 4) is a rational prime. By Exercise 5.52 and 
Theorem A.25, 


G(x) = X7G(x") = X(QG(x) (mod q). 


G**(x) = x(q) (mod q). 
Therefore, by Exercise 6.21, 
X(q) = G(x) = (G40) P4 = (PRR (xx) (mod gq). 
However, by Lemma 6.5, J?(x,x ) = 77. Thus, the last congruence becomes, 
(pn?)@-U/4 = (n3x)@-Y/4 (mod gq). 


Now let g = 77 in Z[t]. Then by Proposition 6.2 on page 278, the above congruence becomes 


Cree Cie ka. 


Hence, in particular, we have shown that 
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By taking complex conjugates, we get, 


©.=(),0,-(, on 


which establishes (6.22) for a = q = 1(mod 4), thereby securing Case 6.5. 


Case 6.6 a, © Z/i] are arbitrary primary integers with gcd(a,@) = 1. 
By Case 6.5 and the factorization property given in Definition 6.5, we may assume that 
gcd(a, b) = gcd(c, d) = 1. 


Claim 6.9 If a € Z, a= 1(mod 4) and @ € Z{i] is primary with gcd(8,a) = 1, then 


ele 


Given the factorization property in Definition 6.5, Claim 6.9 follows from Case 6.5 and 
Lemma 6.5. 


Let 
On = (-1)-D/2, 


where n € Z is odd. The reader may easily check that oga, o-c, and oga-(ac + bd) are 
primary. In the sequel, we will use the facts that 


Oe = (-1) YD? = (-1)¥? = #4, (6.27) 


and similarly, 


~~ 


O,=7°. (6.28) 
Since ca = ac+ bd(mod 8), then 


(2), (3),- (2), -(9) (252), om 


where the last equality follows from the fact that (%) 1 = 1. Also, by Claim 6.9, 


ee = B = e+ adi = ai _ a = j-ece)/2 
Boda Oc} 4 Gee Js Che OC) 4 , 


where the penultimate equality comes from Lemma 6.5 and Exercise 6.16 on page 292. 
Thus, from Case 6.5 and Claim 6.9, (6.29) becomes 


o Fa B “(aec—1)/2 
=|} = , 6.30 
(5), (3), (aaxerm),' \ 
By a similar argument to the above, 
B Te av (aa—1)/2 
a g : 6.31 
€ 4 e) Ta c(ac + bd) hs (6:31) 


By taking complex conjugates in (6.31), we get 


B _ (%e a -(l-oqa)/2 
(=) 7 ‘ey (soerta),' 


4 
since (2°), = +1 from (6.27). Now we multiply this last equation by (6.30) to get, 


5), 0.-C.G) Gata) 0 
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Claim 6.10 


(ar ($). (a a). = Gace 


First, we need to show that gcd(ad — bc, ac + bd) = 1 in order to invoke Exercise 6.16. If a 
prime 


D | gcd(ad — bc, ac + bd), 
then p is necessarily odd. Let p = a7 in Z[i]. Then since 7 | D, 
m | @B = (ac + bd) + (ad — be)i, 


and 


1 | aB = (ac + bd) — (ad — be)i. 
Since ged(a,@ ) = 1 = gced(a, 8), then 
T | q@ and 7 | a, 


or 


x| Band x | B. 


Without loss of generality, we may assume that the former is the case. Hence, 
m | a@ and 7 | (a+@) = 2a, 


SO 7 | a and 1m | b, given that p > 2. Hence p | gcd(a,b) = 1, a contradiction. Now we may 
invoke Exercise 6.16 to get 


fered _ factbd+(ad—be)i\ | (ad — bc)i - i 
Tatc(ac+bd)) 4 — Jadc(ac + bd) a \Gaoe(act+bd)), \oarc(ac+ bd) ) ,° 
Also, from Lemma 6.5 and (6.27)—(6.28), it follows that 


: Z NL. 2X08 
(2) (=) = (“) @ _ (+) (5) = j(1-a)d/2;(1—0)b/2 _ jbd/2jba/2 
a/a\ B 4 al 4 B 4 as 4 B 4 


since a and @ are primary. This completes the proof of Claim 6.10. 
By Claim 6.10, (6.32) becomes 


a 6 = ae Se jloce-eaa)/2 = ja. Ja0c(ac+bd)+occ Faa)/2 
Bla\a/y Oao-(ac + bd) } , 


However, by definition o.c = a,a = 1(mod 4), so the latter equals 


p77 aeebd/2 = (Ad jrserebas pe ayes, 


This establishes (6.21) in general. 
An application of the Biquadratic Reciprocity Law is the following. 


Theorem 6.6 — Quartic Reciprocity and Prime Representation 

Suppose that p = 1(mod 4) is a rational prime with a = (—1)°-)/4 (mod 4), where p = 
a? + b?. Also, let b/2 = 1(mod 4) if p = 5(mod 8).®!4 Then 6 is a quartic residue modulo 
p if p= 1(mod 8), and b/2 is a quartic residue modulo p if p = 5(mod 8). 


6.14These choices of a and b are made without loss of generality since one of ta and one of +b/2 must 
satisfy the congruences. 
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Proof. If p = 1(mod 8), then a + bi = 7 is primary since a = 1(mod 4) and b = 0 
(mod 4). Therefore, by the Biquadratic Reciprocity Law, Proposition 6.3 and Exercise 6.16 
on page 292, 


.-(9,-O.0.-O.0.-C2)0 
= Che = e) Gia. = = =. 


Hence, by Exercise 6.22, b is a quartic residue modulo p. 
If p = 5(mod 8), then since —a = b/2 = 1(mod 4), both = a+ bi and 6/2 are primary. 
Therefore, by the Biquadratic Reciprocity Law, and Exercise 6.16, 


(o).° Ga),> Gra)" 


Thus, by Exercise 6.22, b/2 is a quartic residue modulo p. 


In Theorem 6.4 on page 273, we gave the Supplement to the Cubic Reciprocity Law. Now 
we look at the exceptional element 1+ 7% in the quartic case. The following result is due to 
Eisenstein, and the ideas in the proof are due to K. S. Williams [72]. 


Theorem 6.7 — Supplement to the Biquadratic Reciprocity Law 
Let a =a-+ bi € Zii] be primary. Then 


a j(a—b-1-b?)/4 
Gr J} 4 : 


Proof. First we establish the result for the case b = 0. 


Claim 6.11 If a=a€ Z, where a = 1(mod 4), then 


(=) _ o-/4 
a /4 


It suffices to prove the claim for a = +p where p is a rational prime. To see this, assume 
that we have rational integers a; = a2 = 1(mod 4). Then 


(ay — 1)/44+ (a2 —1)/4 = (aiaz —1)/4 (mod 4). (6.33) 


We first assume that a = p = 1(mod 4) is a rational prime, so p = 77 in Zi]. Then 


(14) =(428) (29) = (29) (9,059. a 


where the last equality follows from the fact that 1+7%=i(1—7). Thus, (6.34) becomes, 


a 1g \ (fT = ui — j(Nrr)—-1)/4 _ j(p-1)/4 
T 4 Tv 4 T 4 7 4 . 


Now we may assume that a = —p = 1(mod 4) where p is a rational prime. By the Binomial 
Theorem, 


a+r= > (lard =148=1-3 (mod p), 
j=0 
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since p = 3(mod 4) and i? = —i. Therefore, 


(- LE "| =(1+ j)@?-v/4 =((1+ i)P—1)(p+1)/4 = j(-p-1)/4 (mod p), 
4 


which establishes Claim 6.11. 


By (6.33) and Claim 6.11, we may assume that a + bi is primary with gcd(a,b) = 1. Set 
a* = (—1)*/?a = 1(mod 4). Then by Exercise 6.16 on page 292, Lemma 6.5 on page 280 
and part (d) of Proposition 6.3, 


1+i\ —(a\ "(bi Ltt) _ er-aye (abi L+i 
at bi ran Oy 5 a*],\a+bi ae Gf fg Na Og 


b/2 


However, since a* = a(—1)°/? = ai®, then by the Biquadratic Reciprocity Law, the previous 


equation equals 


a . “b % 
jat-/2 a ba _ ;(a*-1)/2 ( _% a 
at+bi/,\at+bi/, at+bi/,\at+bi/, 


“ b . : 
= (e*-1)/2 7 eV ga gla 1) gb) (2 | SE 
a+bi/, \at+bi/, at bi), 


and since a+ ai = a+ bi +i(a—b), then by Lemma 6.5 this equals, 


j(a” —14+b(1—a)) /2 i(a — b) — j(a*-1+b(1—a))/2;(1—a)/2 (a—b) 
at+bi /, a+bi/, 


j(a*—atb(1-a))/2 (a—b)\ _ j30?/4 (a — b) 
a+bi/, a+bi },’ 


where the last equality follows from the fact that 


(a* — a)/2+b(1— a)/2 = 67/4 + b?/2 = 3b?/4 (mod 4). 


Since a — b= 1(mod 4) is primary, then by the Biquadratic Reciprocity Law, 


j30?/4 (a — b) _ ;382/4 (a + bt) _ ,-e/4 (a—b+b+ di) 
a+bi /4 a—b }, a—b A 


_ 0/4 (b+ bi) = je /4 (1+ 7%) 
a—b }, a—b/,’ 


where the last equality follows from Exercise 6.16 on page 292. From Claim 6.11, this equals 


707 /4;(a-b-1)/4 j(a—b-1- 8°) /4 


which completes the proof. 


An application of Theorem 6.7 on the preceding page is the following, which also is con- 
sidered to be one of the supplementary laws for biquadratic reciprocity—see Lemma 6.5 on 
page 280. 
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Theorem 6.8 — The Quartic Nature of 2 
If tr =a+bi € Zi] is a primary prime, then 


Proof. Since 2 = i3(1+7)?, then 


2),-(2,(E), 


so by Theorem 6.7 on page 286 and Lemma 6.5 on page 280, 


TT 


2 
(2) = j3(1—a)/2;(a—b-1—b*) /2 = j(2-2a—b—b?) /2 
4 


Since a + b= 1(mod 4), then 


2—2a-—b—b? = 2(1-—a) -—b-? = 2b—b-—& =b(1—b) =ab (mod 4), 


from which the result follows. 


The following consequence of Theorem 6.8 was conjectured by Euler and proved by Gauss 
as a consequence of his work on biquadratic reciprocity. 


Corollary 6.2 Let p= 1(mod 4) be a rational prime. Then there exist x,y € Z such that 
p = x? + 64y? if and only if 2 = z*(mod p) for some z € Z. In other words, if p splits in 
Q(i), then p = x? + 64y? for some x, y € Z if and only if 2 is biquadratic residue modulo p. 


Proof. Since p = 1(mod 4), then p = a? +b? = x7, where t = a+bi € Z/i] is primary since 
we may choose a to be odd and b to be even. By Theorem 6.8, Exercise 6.22 on page 293, 
and part (b) of Proposition 6.3 on page 279, 2 = x*(mod p) if and only if 8 | b. In other 
words, p = x” + 64y? for some 2, y € Z if and only if 2 is a quartic residue modulo p. 


We continue with a result from the realm of rational biquadratic reciprocity. This refers to 
those quartic residue symbols which assume only values +1 —see Exercises 6.16—-6.18 on 
page 292. In particular, if p = q = 1(mod 4) are primes such that p is a square modulo 
q, then by Exercise 6.17, (4), = 1 or —1 depending upon whether q is a quartic residue 
modulo p or not. Thus, (4), depends only upon p and q and not upon z.°! This naturally 
leads us to ask for the relationship between (4), and (4)4 where q = pp in Z[i]. In 1969, 
K. Burde [8] discovered the following elegant answer. 


Theorem 6.9 — Burde’s Rational Quartic Reciprocity Law 
Let p = q = 1(mod 4) be rational primes with Legendre symbol es = 1, and set p= 77, 
q = pp where r =a+bi,op =c+di € Z[i], are primary.°!® Then 


(2), e) z (tht) = (yee (i) 2 (weit), 


6-15The reader is cautioned that, for the above reasons, it is common practice in the literature, to use the 
symbol (P)4 for (P)a- However, Exercise 6.16 tells us that (2)4=1, 80 when (P)4 is used as a rational residue 
symbol, it takes on a different meaning from that established in Definition 6.5, so a caveat has to be given 
to that effect—see [3, p. 252], for instance. For the sake of clarity, especially for the “browsing” reader, we 
break with convention and avoid such notation, which is unnecessary in view of Exercise 6.22. 

6.16 We lose no generality by assuming primary 7 and p here. See Claim 6.14. 
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Proof. First, we establish the following. 


Claim 6.12 (%) =(t) 


Using Proposition 6.3 on page 279 and the Biquadratic Reciprocity Law, we get 
(4) = (2) = (2) (2) = (=) (=) = (=) = (=) 
Wes Tara WANT) 4 Pla\P/a PP) 4 q) 4 
which secures Claim 6.12. 


Claim 6.13 (2), (), = 7(9-1)/2 (mod p). 


TT 


By Claim 6.12, and Proposition 6.3, 


OED): 
TA NPS 4 G/4\P/a PJa\PJa\P/Ja\P/a Pra 
However, by Proposition 6.2 on page 278, 

2 2 
(<) = (aNe)-DI4) = gD (mod 

4 


which yields Claim 6.13. 
By the Quadratic Reciprocity Law, 


coon (2) = (1) = (a) = (a) = (ala? mot. 


Thus, from Claim 6.13, 


(4) (2) = (ac + bei) /? = (ac + bd + W(e + di)i)-Y/? 
4 4 


ate) (mod p), 
q 


from which the first equality in the statement of the theorem follows, since the latter 


congruence also holds modulo / given that both sides of the congruence are +1. To get the 
ad+bec 
q 


= (ac + bd)G-V? = ( 


last two equalities, we need the following, which establishes that ( ) is independent of 


sign. 
Claim 6.14 (t-te) = (set), 


Since q = c? + d?, then 


(=~) (== 7 (<< —7e) - (<* + b?d? — b?d? =*e) 

qd qd qd qd 

7 (A es) _ (He) _ (<*) 7 (2) =4 
q q q q , 


This completes the proof of Claim 6.14. 
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From Claim 6.14 we have, 
(==) (=) = (=e) (== = Gas _ (=) 
eal 


— (_1)(@=1)/8 — ((¢1)(a+1)/2)(q-1)/4 — (_1)(q-1)/4 
(=i) (1) ) (=I) ; 


from which the penultimate equality of the theorem follows. The last equality in the state- 
ment of the theorem follows by symmetry. 


Given distinct primes p = gq = 1(mod 4), such that p is a quartic residue modulo q, Burde’s 
Theorem gives necessary and sufficient conditions for gq to be a quartic residue modulo p. 
For instance, we have the following illustration. 


Example 6.5 Let p = 29 and q = 181. Here we may take a = 5, b= 2, c=9 and d= 10. 
Since 64 = 29(mod 181), then (2%), = 1, where g = c? + d? = pp. Therefore, by Burde’s 
Theorem with p = a? + b? = a7, 


(8) 
7 Ge (ia) 7 @ ae 7 (is) 7 (is) 7 © i 


by (A.10) on page 342. Hence, 181 is not a fourth power modulo 29. However, (3) = 
y (b 


(+) = 1, so 181 is a quadratic residue modulo 29. This places 29 in categor ) of 


Exercise 6.18 on page 292. 


As valuable as rational biquadratic reciprocity has shown to be, it has its limitations in the 
greater scope of applications of biquadratic reciprocity. For instance, if n > 3 is an odd 
integer and p = 4n + 1 is prime, then 


N=P? 41 = 07 = 9°)? £71)" +o? 14) AB, 
is not prime. The Quadratic Reciprocity Law tells us that 


2 
P 


227 4.4 = Qe-V/? 4 1=( ) +1=0 (mod p), 


since p = 5(mod 8) by (A.10). Thus, p | N. The question naturally arises (and was posed 
by Brillhart in [6]): Which of A or B does p divide? To answer the question, we need to know 
A modulo p. Since 2” = 2/4 and (2) = —1, we need to determine which of 2°°-))/4 = j 
(mod 7), or 2@-))/4 = —i(mod z) holds, where p = 17 with 7,7 € Z[i]. In other words, 
we are in category (a) of Exercise 6.18. Rational quartic reciprocity does not help us here. 
Instead, we close this section with a demonstration of how the Biquadratic Reciprocity 
Laws and its supplements may be used to answer the above question. The following was 
first proved by Gosset [22] in 1910, but the following proof is due to Lemmermeyer [38]. 
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Theorem 6.10 Quadratic Reciprocity and Factorization 
Let p = 4n +1 =a? + 4b? be a prime where n € N is odd. Then 


2°77 4.1 = AB, 
where 
Aa aor? a 7) Bao? QO)? =f 
b=+3 (mod 8) if and only if p | A and B = 2(1+2") (mod p), 
and 
b= +1 (mod 8) if and only if p | Band A= 2(1+ 2") (mod p). 


Proof. First, set 7 = a+ 2bi, where we may assume that a = 3(mod 4) and 26 = 2(mod 4) 
without loss of generality since one of +7 must satisfy the congruences. As shown in the 
preamble to this theorem, p | (2?” +1). Thus, by Theorem 6.8 on page 288, 


T 


g(p—1)/4 — (2) =i =i? (mod 7). (6.35) 
4 


Also, 2(?+3)/8 — ((1 + i)?i71)(+8)/8 — (1 + i) +3)/4;-(+8)/8 | from which it follows that 


g(P+9)/8(1 4 4) = (1 4 O-D/4j-OH9)/8 = (=) j-(p+3)/8 
TT 
4 


~ j(a—2b— 1-40") /4,,—(p+3)/8 = j(2a—4b—a? — 126? —5)/8 (mod nm), 


where the penultimate congruence follows from Theorem 6.7 on page 286. A calculation 


shows that ba 92 j 
oD, 4 12, 
a! 5 2 = Z ; (mod 4). 


Thus, we have shown that 
2(P+3)/8 = (1 + 4)i8-9)/2 (mod 7). (6.36) 


We now use (6.35)—(6.36) in each of the following cases. 
If b = 1(mod 8), then 


B= Q?-)/4 4 9@t9)/8 4 pg 4 GOOG 44) + bSP +74+74120 (mod 7). 


By taking complex conjugates, i + 7? + i? + 1=0(mod 7), so B= 0(mod p). 
If b = —1(mod 8), then 


Be Ae Pg ea Sg eee SO od), 


so as in the previous case p | B. 
If b = 3(mod 8), then 


A=2?-D/ _ 9019/8 4441507 -18 972 040941=1-(40941=0 (mod a), 


so as above, p | A. 
If b = —3(mod 8), then 


A=i?-i8 97041 41=8-8(141)4+1=0 (mod n), 


and as above A = 0(mod p). 


Since A+ B= 2+ 2"+!, then the remaining congruences follow. 
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Exercises 


6.16. 


Let a,b € Z with b odd, a nonzero and gcd(a, b) = 1. Prove that 


6.17. Let a € Z be nonzero, and let p = 1(mod 4) be a rational prime with p = 77 where 


6.18. 


m is a prime element of Z[i]. Prove that 


where the symbol on the right is the Legendre symbol. In particular, conclude that 


(*) = +1 if and only if (<) = 
w/a D 


With the same hypothesis as that of Exercise 6.17, prove that 


(san) = Gz) 


and that this multiplicative group is partitioned into three parts as follows: 


(a) The biquadratic residues, namely those for which (2), = 1. 

(b) The quadratic residues that are not biquadratic residues, namely those for which 
(est. 

(c) The quadratic nonresidues, namely those for which (2)4 = +7. 


The facts established in Exercises 6.16-6.18 are aspects of what is called rational biquadratic 
reciprocity. See the preamble to Theorem 6.9. 


6.19. 


6.20. 


6.21. 


Prove that a Gaussian integer, which is not a unit, is primary if and only if it can be 
factored into a product of primary Gaussian primes. 


Let a@ € Zi] = Or be a nonunit with (1+ 72) { a. Prove that there exists a unique 
unit u € Uo, such that ua is primary. In particular, conclude (via Remark 6.5 on 
page 280) that if a is primary, then u = 1. 

Let x = yo = (=)4 where p = 1(mod 4) is a rational prime with p = 17 in Z/i], and 
m primary. Prove that 


G(x) = Tax VP = (IPP yp. 


(The fact that G?(x) = (—1)°*%)/4x,/p has some historical interest. It implies that 


G(x) = o4/(—1)+9)/42)/p, where o = +1 and the square root has positive real part. 
In 1979, Matthews [45] proved that 


1--(2), (8) 


where the symbol on the right is the Jacobi symbol and 8 = +1 is defined by B = (2)! 
(mod 7).) 


6.2. 


6.22. 


6.23. 


6.24. 


6.25. 


6.26. 


6.27. 
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Suppose that p = 1(mod 4) is a rational prime with p = a7 in Z[i]. Prove that a 
given 6 € Z/i] is a quartic residue modulo p if and only if it is a quartic residue modulo 
both a and 7. Furthermore, prove that 6 is a quartic residue modulo p if and only if 
5 = z*(mod p) for some z € Z. 

Let p = 3(mod 4) be a rational prime, and a € Z with p { a. Prove that a is a 
biquadratic residue modulo p. Furthermore, establish that there exists an x € Z such 
that a = 2*(mod p) if and only if (4) = 1, where the latter is the Legendre symbol. 


Let p > 2 be a rational prime. Prove that if 


2=-x* (mod p) for some x € Zi], then p= +1 (mod 8). 


Let p = 1(mod 8) be a rational prime, so p = a? + 2b? for some a,b € Z—see 
Corollaries 4.1—-4.2 on page 141. Also, let p = 17 for 7,7 € Zi]. Prove that 


where the right-hand symbol is the Legendre symbol. 
This fact that for a prime p = 1(mod 8), the biquadratic character of 2 is determined 
by its decomposition p = a? + 2b7, was first proved by Gauss. 


Let p = a2 + &? and q = c? +d? be distinct primes with b = d = 0(mod 2). Use 
Theorem 6.9 on page 288 to prove that 


(= *) 7 (= 

p ae as 
Let p = a? +b? = 4m+1 be a rational prime where 7 = a + bi € Z[i] is primary. 
Establish each of the following. 


(a) Ifx= y, then J(y,x?) =7. 
(b) 2a = (-1)™(7”") (mod p). 


m 


The result in part (b) was first proved by Gauss in 1828. A more involved result in 
this direction was found by Cauchy, namely if 


p= 20m4+1 =u? +5v? 


eg Gh) =4u? (mod p). 


Numerous results involving congruences and binomial coefficients have been found over 
the last century see [3, Chapter 9, pp. 268-293] for details. 


is prime, then 
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6.3. The Stickelberger Relation 


Personal relations are the important thing for ever and ever, and not this outer life 
of telegrams and anger. 


from Chapter 19 of Howard’s End (1910) 
E.M. Forster 
English Novelist 


In 86.4, we will prove the Eisenstein Reciprocity Law, which generalizes the laws studied 
thus far in this chapter. However, in order to do so, we need to develop some notions 
surrounding the concept in the title, which is the primary object of study in this section—see 
Theorem 6.12 on page 302. First we need the following generalization of ideas developed in 
the preceding two sections. 


Proposition 6.4 — Power Residue Congruences 


Suppose that F' = Q(¢,,) where n € N and p is a prime D y-ideal with N(p) = ¢ = 1(mod n). 
If a € DOr and a ¢ p, then there exists a 7 € Z, unique modulo n, such that 


alt-Y/" = ¢F (mod p). 


Proof. Since |(Or/p)*| = q—1, then a?! = 1(mod p). Therefore, since N(p) = q = 
1(mod n), then a%—-)/" is a root of x” = 1(mod p), as are the distinct values ¢/ for 
j = 0,1,...,n—1. Thus, a%—)/" must be (uniquely) one of them by Exercise 4.29 on 
page 163. 


Definition 6.7 — Power Residue Symbol 


Let F = Q(¢,), where n > 1 is an integer, a € Dp, and p is a prime D-y-ideal with n ¢ p. 
Then the n“” power residue symbol is defined to be 


(*) = q\N()-)/" = (J (mod p), when a ¢ p, 


where j is the unique integer given in Proposition 6.4, and 


a 


—} =0, when a€ p. 
G), 


If J is an Dp-ideal and 


is the prime factorization of J given by Theorem 1.17 on page 28, where the P; are not 
necessarily distinct, and gcd((n), I) = 1, then 


and if gcd(a,6) = 1, then 
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Proposition 6.5 — Properties of the Power Residue Symbol 
Suppose that F = Q(¢,), a,8 € Or and I,J are D;p-ideals relatively prime to n. Then 


(b) (Fy)n = (F) nl Fa 


(c) Ifa is prime to J and £” = a(mod J) is solvable for some x € Of, then (¢),, = 1. 


Proof. Part (c) follows from Exercise 6.1 on page 275, and parts (a)—(b) follow directly from 
Definition 6.7. 


Now we bring Galois theory into the picture. 


Proposition 6.6 — Galois Action on Residue Symbols 
Let F = Q(¢,), and let I be an Op-ideal with gcd(J, (n)) = 1. If o € Gal(F/Q), then 


eae 

Phi NIG 

Proof. Given property (b) of Proposition 6.5, it suffices to prove this for the case where 
I =p is a prime 9 y-ideal. By Definition 6.7 and the fact that N(p) = N(p7), 


(=) = (a7)NP)=D/n = (gQeyNW)=V/n = (g(N@)=D/nyo = (=) his. 


which is the desired result. 


Now we bring Gauss sums into the picture. For a reminder of the definition of Gauss sums, 
see Exercise 5.52 on page 260. 


Definition 6.8 — Power Residue Characters 


Let F = Q(¢,) where n € N and p is a prime 9 -ideal such that n ¢ p. Suppose further 
that N(p) = q = p’, p is a rational prime and f = fr/o(p), where p! = 1(mod n)—see 
Corollary 5.13 on page 218. If y(a@) = a is the image of a under the natural map W : Dp + 
Or/p, then for (a) = a £ 0, define a character x”) on Fy = Or/p by 


The reason for the choice of the inverse in Definition 6.8 will become evident in the proof 
of Theorem 6.11 on page 298. On the basis of Definition 6.8 and the definition of Gauss 
sums given in Exercise 5.52, we introduce the following link to Gauss sums. 


Definition 6.9 — Gauss Sums and Power Residues 


With the assumptions of Definition 6.8, we let x = xf”). Then we define 


Glp) = Gx) = GQ) = SY x(a)gp'e”?™, (6.37) 
weF, 
and 
(p) = G(p)”, (6.38) 


The &(p) were studied by Jacobi in 1827 for the case where q = p. 
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Proposition 6.7 — Properties of Power Residue Gauss Sums 
With the assumptions of Definition 6.8, each of the following holds. 


(a) G(p) € QCnp)- 
(b) |G(p)P? =¢. 
(c) G(p) € Qn). 


Proof. By Equation (6.37) on page 295, and the fact that the values of xe are n'” roots 
of unity, with p{n, then (a) follows—see Exercise 5.27 on page 231. Part (b) follows from 
Exercise 5.54 on page 260. Part (c) follows from Claim 6.1 on page 265 since the proof of 
that claim extends to Fy. 


In order to establish the Stickelberger Relation, we need to understand the decomposition 
of primes above p in various cyclotomic extensions—see Biography 1.4 on page 54. The 
following development is toward that goal. We first remind the reader of the notion of the 


order of an ideal modulo a prime ideal introduced in Exercise 1.44 on page 34, denoted by 
ord, (I). 


Proposition 6.8 — Properties of ord, (I) 
The integer ord,(JZ) satisfies each of the following. 

(a) Ifq is a prime Op-ideal, then ord,(q) = 0 if p # q, and ordy(q) = 1 if p= q. 
(b) IfJ and J are Op-ideals, then ord, (IJ) = ordy (I) + ord, (J). 


(c) Suppose that I is an DO ;-ideal and 
r=JT]p™. 
p 


where the product is taken over all distinct prime D-ideals p, is the unique factor- 
ization given by Theorem 1.17 on page 28 with a(p) 4 0 for only finitely many such 
integers. Then a(p) is that unique nonnegative integer given by 


a(p) = ord, (JZ). 


Proof. Since p Z p? and since ord,(q) > 0 if and only if q C p, then part (a) follows. Part 
(b) is part (a) of Exercise 1.44. Part (c) follows from Theorem 1.17 on page 28. 


Diagram 6.1 
PC DOK —> OK/P 


BC 0, ——> 0,/P 


p © Z—+ Z/p@ 
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Diagram 6.1 will be a visual aid for the reader in the proof of the next result. 


We now return to a consideration of the decomposition of p in cyclotomic fields discussed 
above. The following sets the stage for the Stickelberger Relation, since what is at the heart 
of this relation is prime ideal decomposition in cyclotomic fields, given that Gauss sums are 
in such fields. The reader should be familiar with the results surrounding Corollary 5.13 on 
page 218 before proceeding. 


Let n € N, F = Q(¢,), p a rational prime such that p { n, and p a prime O;-ideal 
above p. Furthermore, let K = Q(¢,(q-1)), L = Q(¢q-1), where q = pf = 1(mod n), and 
f = frjo(p). Also let P be a prime Ox-ideal above p, and set PAL = P. 


Proposition 6.9 — Order of Ideals in Cyclotomic Fields 
With notation as in the above preamble, each of the following holds. 


(a) orde(pOx) =p-1. 
(b) ordp(1—G) =1. 
(c) orde(p) = p—1. 
(4) Or/p ¥Ox/R. 


Proof. Part (a) is an immediate consequence of Corollary 5.13. Also, from Example 5.8 on 
page 190, we see that 


POP = pOge,) = (1— Gp)? *Dag,) = (1- G)P TF, 


so 
p-l 


g 
POK = POQ(¢,) OK = (1— Cp)? OK = | [TP 
j=1 


where P; = P say, and g = gx/g(p). Thus, (1—¢,)Ox = a P;, and (b) follows. Also, 


from Corollary 5.18, 
p-l 


g g 
pl [pdx = {P][% 
j=2 j=2 
Therefore, since gcd(P, P;) = 1 for 7 > 1, 
pOK ar ‘ame 


from which (c) follows. 
Lastly, we establish part (d). By Corollary 5.13, 


fro(p) = |Dx/® : Z/(p)| 


is the smallest natural number such that p!+/e) = 1(mod q — 1). However, q = p‘, so 
p! =1(mod q—1), and f = f,/9(p), so by Theorem 5.1 on page 184, 


f = fro) = frye) fre(p) = fre (p) Ff. 


Thus, 
1= frjp(p) =|01/B: Dr/pl, 


from which (d) follows via Definition 5.1 on page 182. 
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Remark 6.6 Part (d) of Proposition 6.9 on the preceding page allows us to define a 
character for on F; = 0,/% = Or/p, as a (gq — 1)-th power residue symbol, namely for 


VET; 
won =(F) 


where w is the natural map ~ : O; +> Fy. Thus, a has order gq — 1, so it generates 


¢h(Fz). This allows us to introduce another important Gauss sum. 


Definition 6.10 — Gauss sums on F, 


With the setup given in Diagram 6.1 on page 296, and m € N, set 
1 m 
Gm (B) = Gx? )-). 


Remark 6.7 Notice that ifm = (q—1)/n in Definition 6.10, then G, (8) = G(p) given in 
(6.37) on page 295, since for any a € Op we have, 


ee 7 (*) 

2p q—-1 p n , 

The following result will give us the necessary machinery to prove the desired Stickelberger 
Relation. 


Theorem 6.11 — Orders of Gauss Sums on F, 


Given the setup in Diagram 6.1, and me N withO<m<q, 


f-1 
orde(Gm(B)) = So al”, (6.39) 
j=0 
where the ay are defined by 
f-l 
m= al”) pi (6.40) 
j=0 


which is the unique representation of m to base p with 0 < ae 


Proof. If q = 2, then 


Gi(B) = Gi(p) = G1) = Vp 
by Exercise 5.34 on page 232. Therefore, ordp(Gi(%8)) = 1. We may now assume that 
q > 2. 


First, we note that it is a fact from elementary number theory that any integer has a unique 
representation as given in (6.40)—for instance see [53, Theorem 1.5, p. 8], known as the 
Base Representation Theorem. To establish (6.39), we first consider the case m = 1. Let 


Ap = 1—G and set x = en Then 


q-1 q-1 


- are Fp (J) Fg /Fp (J 
GR SOR OG ot Se Ap) Fat), 


j=0 j=0 
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We may let n; € N such that 


ny = Te, ye, (W(Gj-1)) (mod p), 


where w is given in Definition 6.8 on page 295, so since 


x (W(G_-1)) = (+) = C2 


q-1 
then 
q-2 ; 
Gi(P) = S02 (1 = Ap) al), 
j=0 


Furthermore, since the Binomial Theorem tells us that 
(1—Ap)™ =1—njAp (mod P?) 
via Example 5.8 on page 190, given that q > 2, and since 
= 
ny = Tepe, (W(G_-1)) = Y G21 (mod p), 
k=0 


by the definition of relative trace in finite fields—see Exercise 5.52 on page 260—then it 
follows that 


—2 f-lq-2 
GP) = G2 (1% a) =» Ge) =-r(a- 1) (mod F*), 
j=0 k=0 j=0 


where the last two congruences follow from Exercise 6.28 on page 310, since 


on 0 fork =1,2,...,f—1, 
q-1 ifk=0. 


Thus, since g = pf = 0(mod ??), then 
Gi(B) =A» (mod P?). 


Since ,, € P—P? by part (b) of Proposition 6.9 on page 297, then ordp(Gi()) = 1, which 
completes the proof for m = 1. 


Claim 6.15 If 1<m,n,m+n<q-1, then 
orde(Gmin(B)) < orde(Gin(B)) + orde(Gn(P)). 
By part (a) of Lemma 6.2 on page 264, 
Gm(B)Gn(B) = Ja(xX—")Gm+n(P)- (6.41) 
Thus, by part (b) of Proposition 6.8 on page 296 
orde(Gim(B)) + orde(Gn(P)) 


= orde(Jg(x~™, x~")) + orde(Gmin(P)) = orde ((Gmin(P)), 
and Claim 6.15 follows. 
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Claim 6.16 ordp(Gnin(B)) = ordp(Gin()) + ordy (Gn (P)) (mod p— 1). 


By Corollary 5.13 on page 218, 
BO rw = perk. 


and by Exercise 5.27 on page 231, 
Jg(x”™,x—") € L, 


so 
(p—1) | orde(Jq(x~™5x~"))s 
from which we get Claim 6.16, via (6.41). 


Claim 6.17 For m €N, ordp(Gpm()) = orde(Gn(B)). 


Since 
oz te cGy Tr, 2, (3?) 
CB) Soe KU) OG tO =O) Ge. = GAD), 
j=0 j= 


since j + j? is an automorphism of F,, and Ty, /g, (3) = Tr, /r,(J?). Claim 6.17 follows. 


Claim 6.18 If 1<m <q, then 


By Claims 6.15, 6.16 and the already proved fact that ordp(G1(%8)) = 1, we get 


ordp(Ga(B)) =a 


for 1<a<_p. Thus, using Claims 6.15 and 6.17, 


f-1 f-1 f-1 
orde(Gm(B)) < 5° ord (Gm); (B)) = S ordp(G om (PB) = S> al™, 
j=0 , j=0 : j=0 
which is Claim 6.18. 
Claim 6.19 574? ordp(Gn(B)) = LEVE) 
By Exercise 5.54 on page 260, 
Gin (B)Gq—1-m (PB) = x(-1)™¢ = x(-1)™p", (6.42) 
since ess 
Gq-1-m(P) = G((xge”)™)- (6.43) 


By taking ord» of both sides of (6.42), and using part (c) of Proposition 6.9 on page 297, 
we get, 
orde(Gm(P)) + orde(Gq-1-m(B)) = f(p — 1). 
Thus, 
q-2 


S7 orde(Gn(B)) + So ordy(Gy_-1-m(B)) = F-V(a-2). 


m=1 
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However, by (6.43), 


S— orde(G@m(P)) = > ordp(Gq—1—-m(P)), 


so Claim 6.19 follows. 


Claim 6.20 >>?) oe al) — — fer?) 


Since g-1= eS (p — 1)p’ by Theorem B.4 on page 347, then by (6.40) on page 298, 


f-l 
g-1-m=)> (p-1-a™) pi 
j=0 


This shows that a =p-1- a” for 7 = 0,1,...,f—1. Therefore, 


f-l 


f-1 
ay + ah” = fp). 
j=0 j=0 


Thus, 
q-2 f-1 


f-1 
af” + Soa ™ | = f(p-1)@- 2). 
j=0 


m=1 \j=0 


However, an easy check shows that 


so Claim 6.20 is established. 
The main result now follows from Claims 6.18-6.20. 


Corollary 6.3 ord,(G(p)) = no ; alle 1)/n)_ 
Proof. By part (c) of Proposition 6.9, 


(p— 1) ordp(G(p)) = orde(G(p)), (6.44) 


but by Theorem 6.11, and the fact, from Remark 6.7 on page 298, that G(q_—1)/n(B) = G(p), 


f 
ny al@Y/) = n- orde(Giq—1)/n(B)) = ord (G(q—1)/n B)”) = orde(G(p)), 


so from (6.44) we get the result. 


We are now in a position to state and prove the following, first proved by Stickelberger in 
1890. The special case where n is a prime and p = 1(mod n) was first proved by Kummer 
in 1847. 
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Theorem 6.12 — The Stickelberger Relation 


Suppose that F = Q(¢,) where n € N, n > 1, and p is a prime O--ideal with n ¢ p. Then 
in Op we have the ideal decomposition, 


(G(p)) = pret’, 


where the sum runs over all natural numbers ¢ < n with gcd(t,n) = 1, and o € G = 
Gal(F'/Q) is given by 04: Gn Cl. 


Proof. By part (b) of Proposition 6.7 on page 296, 
IS(p)? =a =p", 


so the only prime 9 y-ideals dividing 6(p) are those dividing p. Let p; be a prime 9 -ideal 
above p. Then, by Corollary 5.1 on page 190, there exists a 


o4 € Gal(Q(¢n)/Q 


such that 


pr’ =P. 
Thus, for natural numbers t < n, relatively prime to n, we define 
pe=p™. (6.45) 
Claim 6.21 ord, (6(p)) = 52; D4zp af. 
From (6.45), we have, 
ordp, (G(p)) = ord ,-1 (G(p)) = ordy(G(p)”*). (6.46) 


Let z € Z such that z = t(mod n) and z = 1(mod p). Then 


> yy Te, jr, (2) 
Gn = (s Xp (a) aon ean. 
x=0 20 


since ¢5* = Cp by the choice of z. Therefore, 


n 


B(p)” = ( (x$(@)) Gum = (Gravina ®))”, 


x=0 


where the last equality follows from the fact that 


(x)! - Cae, 


via Definition 6.10 on page 298. Thus, by Theorem 6.11, 


f-1 
‘a S- gece) =F ord (Gi(q—1)/n(B)) = ord (Giqg—1)/nB)") = 
j=0 


= ord(G(p)*") = (p — 1) ordp(G(p)**) = (p — 1) ord, (6(p)), 
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where the penultimate equality follows from part (c) of Proposition 6.9 on page 297, and 
the last equality comes from (6.46). We have established Claim 6.21. 


Given t € N, t <n and gced(t,n) = 1, we let t; be defined by 
t=tp’ (mod n), 
for some unique pair (7,7) with 0 < j < f and 1 <i < g, where g is the number of cosets 


of Uzjnz/(w(p)). Thus, t1,t2,...,tj are the rational integer representatives of those cosets. 
Claim 6.21 tells us that 


where 


ya a -D/9) | got, 
t=1 


Claim 6.22 If {x} = x — |x], called the fractional part of the real number x, and || is 
the floor function, then 


roa y (BE) 


For simplicity set 


with s = (t;(q¢—1)/n). 
For i > 0, we let f —7+ 4 denote the residue class modulo f of the integer f —i+ 7. Then 


pm= yl say (mod q-1). 


Since 
= 


Yl ap 5 am <q-1 
for all such 7, then 


pim al) 
{2"}- Ake F=)? 
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which yields Claim 6.22. 
However, p?? = p, by Corollary 5.13 on page 218. In other words, 


(op) = Dp(F/Q) 


—see Application 5.3 on page 231. Therefore, r may be replaced by 


is a UE oy ot =ny {2 hort = Sur (6.47) 


w=1 j=0 


where the last sum runs over all natural numbers ¢ < n and relatively prime to n, and the 
last equality follows from the fact that 


We have shown that 23 
(G(p)) = pret , 


where the sum runs over all natural numbers t < n relatively prime to n, which is the 
Stickelberger Relation. 


The proof of the Stickelberger Relation provides us with a distinguished element that we 
will be able to use in 86.4. 


Definition 6.11 — The Stickelberger Element and Ideal 
With notation as in Theorem 6.12, 


t 
d= a ae 
tml 
is called the Stickelberger Element. The Stickelberger Ideal is 
I(F) = Z[G) nN éZ|GI, 


which are the Z[G]-multiples of 0 that have coefficients in Z. 


Remark 6.8 In view of Definition 6.11, Equation (6.47) in the proof of the Stickelberger 
Relation tells us that 


(G(p)) =p”’. 
Also observe that 
6 € Q\G], 
where 
G = Gal(F/Q). 


See Exercise 5.48 on page 253 for the general definition of a group ring. 


The following three examples illustrate Theorem 6.12 for small values of n. 
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Example 6.6 If n = 2, then Q(¢2) = Q, and p = (p) where p > 2 is a rational prime such 
that 
Gp)? = (-1)”-YPp, 


This is the trivial case with 
G = Gal(F/Q) = 1. 


The Stickelberger Relation does not precisely say this, but we know that this holds by 
Exercise 5.34 on page 232. 


Example 6.7 Suppose that n = 3, f = 1, and p= 1(mod 3). Then p = (z), where z is a 
primary element of Or. Thus, by Claim 6.1 on page 265 and Exercise 6.8 on page 276, 


G(p)? = pa = at? = 71 *?2e, 


where 


(Jp) = Gal(F'/Q) = Gal(Q(¢s)/Q). 


Example 6.8 Let n = 4, f = 1, and p = 1(mod 4). Then p = (zm), where z is a primary 
element of D- = Z|]. By Exercise 6.21 on page 292, 


G4 (p) = pr? =a = 782, 
where 


(op) = Gal(F'/Q) = Gal(Q(#)/Q). 


The following application of the Stickelberger Relation appears as Theorem 145 in Hilbert’s 
Zahlbericht, and was known to Kummer. The following is also a motivator for another result 
of Stickelberger, which we will establish at the conclusion of this section. 


Theorem 6.13 — Stickelberger and Class Groups of Quadratic Fields 
Let F = Q(V—2£) where ¢ = 3(mod 4) is prime and ¢ > 3. Then 


ola, 
where 
N= S- n 
is the sum over all natural numbers n < £ such that (=£) = —1, and 


R= Or 


is the sum over all natural numbers r < £ such that (=4) = 1, where (+) is the Kronecker 
symbol. 


Proof. Let K = Q(¢¢). Since inert primes are always principal and since the ramified prime 
q in F is principal since q = (/—@), then it suffices to look at primes p = pp’ where p is a 
prime 9 -ideal with p 4 p’. Thus, by Theorem 1.17, it suffices to prove that (N — R)/é 
annihilates®" the class (p), where 


(N-—R)/LEZ 


6-17This means that the exponent sends the class group to the trivial group. 
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by Exercise 6.31 on page 310. Let Z denote the decomposition subfield of p in K/Q, and 
let P be a prime Ox-prime over p. From the proof of the Stickelberger Relation, we know 
that 6(P) is a power of G(P) and so all conjugates of 6(P) are in Z. Therefore, the ideal 
generated by G6(P) is in Z, but this does not necessarily mean that 6(P) € Z. We must 
prove this. Set 


x = x, and =p! where f = fx/o(p)- 


If we let 6, be an element of Gal(Q(Gpc)/Q) such that dp|K = op, then 


G(P) = G(x) = s- x(a)? ee)" a Ss (a7?) Cae ie (6.48) 


xeF, xeF, 


Since 


given that op € Dp(A/Q), and 


Tr, /F, (©) = Tr, /r, (27"), °*8 


then (6.48) becomes 
6 Tey /Fp (wa(p)) 
G(P)% = ST x(a”, (6.49) 


reFg 


since £°? ranges over F, as x does, and a(p) is defined by 


CoP = Cap) 
for some a(p) € (Z/pZ)*. In turn, (6.49) is equal to 
ae op petg/Fp(®) a 
S> x7 (a(p))x(@)°? =x (a(p))G(x). 
wey 


We have shown that 


Hence, 6(P) € Z. 


For convenience sake, we may now let P denote both the prime 9 ,-ideal above p and the 
prime 9 z-ideal above p, since there is no splitting between Z and K. Diagram 6.2 below 
illustrates the scenario in the balance of the proof. By the Stickelberger Relation, 


(G(P)*) = (G(P)) = PEt tae 
Thus, by taking norms, we get 


(8) = (Nzyr (G(P))) = (Nar (py) =e te 


6.18Recall from Exercise 2.16 on page 64 and Definition 5.1 on page 182 that Op may be regarded 
as an element of Gal(F,/F,) via the natural map Gal(K/Q)++ Gal(F,/F,) since Fy =Ox/P and 
Fp = Oz/% = Z/pZ, where P= PN Z. 
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paz pois ter" = por ryln” _ pep’ 2. Tae 


since p’ = p’ for each n such that (=!) = —1.°19 Now let y = W/8. Then we have the 
ideal equation in M = F(¢,) given by, 

(7) = (Nay (GP) = pM”, 
where L = Z(¢,). 


Diagram 6.2 
Q(Cpe) 
a Xs 
L= Z(G) K = Q(¢e) 
a NS / 
M = F(¢) Z = Z,(K/Q) 
NS va 
F=Q(v-8) 


It remains to show that y € F. Let R = F(\/8). Since M/F is totally ramified at p and 
RC M, then it suffices to show that R/F is unramified, since then R = F. Given that we 
chose p to be unramified in K, then K(/8)/K is totally ramified at p by Theorem 5.1 on 
page 184. However, since (3) is the ¢“” power of an ideal in K, then K(/8)/K can only 
ramify at prime 0j-ideals over € by Theorem 5.19 on page 235. This forces K(</B) = K 
so R = F. Observe that p®-)/£ ~ 1 implies that 


pN-R)/e Ap 


so the result is secured.®-?° 


Example 6.9 Let @ = 23 and F = Q(—23). Then 
N=5+7+104+114+14+15+17419+ 20+ 214 22 = 161, 


and 
R=14+24+34+44+64+8+4+9+4+124+1384 164+ 18 = 92, 
? N-R 
ea Be 
L 
In fact, he = 3. 


Remark 6.9 Dirichlet actually proved that, for 0 = 3(mod 4) a prime, ¢ 4 3, 
1 


the proof of which involves analytic number theory. Furthermore, we will see as a special 
case of Exercise 6.31 on page 310 that (N — R)/é is indeed an integer, but the proof that 


(N-— R)/€EN 


also involves analytic number theory. Moreover, there is a link between the class numbers 
of Q(V2) and Q(V—8) and continued fraction expansions of V?@ see [30] and [73], as well as 
[49, pp. 158-162] for related results. 


6.19}¢ is instructive to compare this with Applications 5.1—-5.3 on pages 229-231. 
8.20 Jacobi discovered that 4p(\N-R)/é = x? + fy”, for some x,y € Z. On the basis of this result, he conjec- 
tured that hr = (N — R)/é. 
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Example 6.10 One can use (6.50) on page 307 to find class number one complex quadratic 
fields by setting 2 = N — R, from which one calculates that for 


Ar = —£€{— 7,—-11, -19, —43, —67, —163} 


we have hp = 1. 


Remark 6.10 Theorem 6.13 on page 305 says that the class group of F = Q(/—), for 
é = 3(mod 4) a prime, is annihilated by 


e-1 
1 x 1 1 
pata) es (x»- E>] th 
«w=1 n r 
where the residue symbol is the Lengendre symbol. In other words, 
—1Ly l-l g(z i 
a aes hy 


There is a more general result about annihilation of class groups as follows. 


Theorem 6.14 — Stickelberger on Annihilation of Class Groups 


Let F = Q(¢,) where n € N, and let @ be the Stickelberger element. If a € Z[G], where 
G = Gal(F'/Q), such that af € Z[G], then a annihilates Co,. 


Proof. Let p be a rational prime, with p a prime O;-ideal above p { n. Then by the 
Stickelberger Relation, 


(G(p)") = (G(p)) = p””. 
Thus, if a € Z[G] such that a@ € Z(G], then 
(p?*)" = ("*)" = (GP)")* = (G@))". 
Let y = G(p)"*, and set L = F( 4/7), so L is a Kummer extension. Since 
abe Z|G] ~Or 


by Exercise 5.48 on page 253, then (G(p)*) = p®® is an Op-ideal, so (7) is the n“” power of 
an 0 p-ideal. We now show that G(p)* € F. It follows from Theorem 5.19 on page 235 (by 
looking at successive prime degree, g, extensions of F' in L for q | n), that L/F is unramified 
for any prime D-ideal above rational primes not dividing n. Since 


FECL CE QCnp), 


by part (a) of Proposition 6.7 on page 296, then L/F must be ramified at primes above p 
by Theorem 5.4 on page 189. However, by Corollary 5.13 on page 218, the only ramified 
primes in L/F are those above p{n, a contradiction unless L = F’. Hence, 


G(p)* € F. 


We have shown that p?% is a principal ideal in O-. By Theorem 1.17 on page 28, we 
have shown that every ideal prime to (n) is principal in Or. However, by Exercise 1.38 on 
page 33, every class of Cp, contains an ideal prime to (n), so the proof is complete. 


In Theorem 6.13 on page 305, we were not dealing with a cyclotomic extension. However, 
there is a consequence of Theorem 6.14 that does deal with the more general case. In the 
following, we use the Kronecker-Weber Theorem presented on page 244. In particular, the 
reader is reminded that the conductor of an abelian extension K of Q is the smallest natural 
number n such that K C Q(¢,). 
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Corollary 6.4 Suppose that K/Q is an abelian extension with conductor n, and G = 
Gal(K/Q). If 6 is the Stickelberger Element and a € Z[G] such that a0 € Z[G], then C2’. = 
1, where the o; in Theorem 6.12 on page 302 denote both the elements of Gal(Q(¢,)/Q) = 
Gal(F'/Q) and Gal(K/Q), namely we understand o; to mean o;|K when considering Gal(/Q). 


Proof. Let p{n be a rational prime, and let P be a prime OD -ideal over p with PNOK = p. 
Let a € Z[G] such that aé € Z[G]. Extend the elements of G so that a@ may be regarded 
as an clement of Z[Gal(Q(¢,)/Q)]. From Theorem 6.14, we have (pO x)°" = (G(P)*). Let 
a € Gal(F'/K), which permute the prime 9 ,-ideals over p by Corollary 5.1 on page 190. Let 
T € Gal(F (Cp) /Q(p)) such that T| 7 = 0, so CF = Cp (recalling that p{ mn), and p’ = p. Then 
G(P)" = G(P7), by Definition 6.9 on page 295. Hence, G(P)°? = G(P)*, so G(P)* € K. 
We have shown that p®’ = (G(P)*), so as in the proof of Theorem 6.14, Co, is annihilated 
by ad. 
The following illustrates the power of Theorem 6.14 by completely generalizing Theorem 
6.13 with ease. The proof of the following is due to Lemmermeyer [838]. 


Corollary 6.5 LetA - < 0 be the discriminant of a complex quadratic field withA p ¢ 
{—3, —4,—8}. Then 


(N-R)/|Ar| 
Co ea 


where N is the sum of all natural numbers n < |Apr| such that (AF) = —1, and R is the 
sum over all natural numbers r < |Apr| such that (AE) = 1, where (=) is the Kronecker 
symbol. 


Proof. In this case, the Stickelberger Element is 


_R+oN 


6 
|Ar| 


where 


(oc) = Gal(F'/Q) =G. 
Also, by Exercise 6.31 on the following page, 6 € Z[G]. Thus, by Theorem 6.14, 
6 
C5, =1, 


but cs’ =1,s0 
ol oa 5p Coy et 
F F 2 


as required. 


Example 6.11 LetA - = —52. Then 
N=3+5+4 214 234 27+ 334 35+ 374 41+ 434 45 + 51 = 364, 


and 


R=14+74+94114+ 154174 19+ 25+4 29+ 31+ 474 49 = 260 


1@) 
oY PMSel GB a1 


In fact, he = 2 for F = Q(V—18). 


Theorem 6.14 was proved for kK = Q(G,), where p is a prime, by Kummer in 1847. It was 
proved in general by Stickelberger in 1879. 


310 6. Reciprocity Laws 


Remark 6.11 For the reader interested in exploring the consequences of this theory at a 
higher level, we give the following data. Analogues of Theorem 6.14 for totally real fields 
have been found by B. Oriat [57] and A. Wiles [70]. There is also the important work 
of Thaine [68], where cyclotomic units are used to define an analogue of the Stickelberger 
element for real abelian fields. This allowed him to prove a result on the annihilation 
of class groups of real abelian number fields. Subsequently Kolyvagin invented tools for 
constructing relations in ideal class groups, extending Thaine’s methods. These methods 
have had deep and far-reaching consequences. Among them is the use of these tools to give 
an elementary proof of the Main Conjecture of Iwasawa theory—see [69] for details on the 
results surrounding Kolyvagin’s work. 


Exercises 


6.28. Let n € N, n > 1, and ¢, a primitive n*” root of unity in a field F. Prove that 
n-1 >j : 
~j=0 6 = 0 in F. 
6.29. Let Fz where g = p" and p is prime. A Gauss or Jacobi sum over F, is called pure 
if, when raised to a natural number exponent, it becomes real. Prove that quadratic 


Gauss sums are pure, but Gauss sums belonging to characters of order k > 2 are never 
pure when gq = p. 


This result was first proved by Stickelberger in 1890. Pure Gauss sums are a useful 
tool in many areas including the determination of when —1 is the power of a given 
prime modulo a natural number. For instance, see [3]. 


(Hint: Use Exercise 5.84 on page 232, Exercise 5.52 on page 260 part (a) of Propo- 
sition 6.7 on page 296, and part (a) of Proposition 6.7 on page 296.) 


6.30. Let £>3 be a prime with ¢ = 3(mod 4), F = Q(V—2) and p a prime 9 p-ideal above 
the rational prime p = 1(mod £). Suppose further that p = pp’ in Or, and K = Q(¢,), 
with P is a prime Ox-ideal over p. Prove that pOx = |], P’", where the product 
runs over all natural numbers r < £ that are squares modulo @, and o,(¢e) = ¢). 


(Hint: Use Application 5.2 on page 230.) 


6.31. With N, R and @ > 3 given in Corollary 6.5, prove that 


N=R=0 (mod |Ag)). 
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6.4 The Eisenstein Reciprocity Law 


The end crowns the work. 
Early sixteenth century proverb 


The object of this section is to establish the Eisenstein Reciprocity Law—see Theorem 6.15 
on page 313 and Biography 3.10 on page 137. First, we need to extend the definition of 
6(p), given in (6.38) on page 295, from prime ideals to arbitrary ideals as follows. 


Definition 6.12 — Power Residue Gauss Sums Extended 


Let F = Q(¢,) and I be an O p-ideal with I = Ls p,;, a product of not necessarily distinct 
prime 9 -ideals. Then 


8(1) = ]] Sp). 


A sequence of lemmas is required to prepare for the proof of the Eisenstein Reciprocity law. 


Lemma 6.6 — More Properties of Power Residue Gauss Sums 


Suppose that F’' = Q(¢,), I, J are Op-ideals, and a € Op with gcd(IJ,n) = 1 = ged(a, n). 
Also let tT = >, toy ', where the sum runs over all natural numbers t < n with ged(t,n) = 1. 
Then each of the following holds. 


(a) 6()G6(J) = 6(LJ). 
(b) |6(D)/? = (NG))”. 
(c) (6()) ="). 
Proof. Part (a) is immediate from Definition 6.12. By part (a), it suffices to prove parts 
(b)-(c) for J = p, a prime 0 r-ideal. By part (b) of Proposition 6.7 on page 296, 
|S(p)|? =p" = (N(p))", 


which yields part (b). Part (c) is Theorem 6.12 on page 302. 


We now need to explore the action of 7, defined in Lemma 6.6, on power residue Gauss 
sums over principal ideals. 


Lemma 6.7 — Galois Action on Power Residue Gauss Sums 


Let F = Q(¢n), n € N, J an Op-ideal such that gcd(n,I) = 1, o € Gal(F/Q), and 
Tt = 0, to>!, where the sum runs over all natural numbers t < n with gcd(t,n) = 1. Then 


(a) G(D)? = G(I?). 
(b) If we Op, then |a7|? = |Np(a)|". 


(c) If @€ Op such that ged(a,n) = 1, then 6((a)) = +¢2a7 for some j € Z. 
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Proof. By part (a) of Lemma 6.6 on the preceding page we may let J = p be a prime 
OD p-ideal. If a € Dp, then 


q-1 -1 
a Ge Fp (x n res Bp (%(@)) 
G)= (SZ) Ger = TL xPencanyg er 


where the sum runs over all coset representatives (a) € Or/p = Fy (with w being given 
as in Definition 6.8 on page 295). Let 


& € Dp(Q(Gmn)/Q), 


where G|~ =o. Then by Proposition 6.6 on page 295, 


nm oor p (Pla? )) oOo 
= Mp ae =G(p’). 


Thus, by raising each side to the n‘” power we get 6(p)” = G(p7), from which part (a) 
follows. 


For part (b), let o_1 be complex conjugation, namely o_1 : ¢, +> G71. Therefore, 


la7/? =a7a7-}! = Qt tea), (6.51) 
Since 
O01T=0-] Stee - Dee - SI (n—t)o,, 
t t t 
then 
(i+o24)7 = os to) + Soin —t)o',= ny = ny Ge 
t t t t 

However, 


= Ile JP Ge. 
Therefore, from (6.51), 
|Nr(a)|" = lal" 22" = Jal™+e-) = Jar’, 


which secures (b). 
Since 
(6((a))) = (@)" = (@"), 
by part (c) of Lemma 6.6, then as ideal generators, 6(p) and a” differ by a unit. In other 


words, 
S((a)) = ua’, 


for some u € Lp,. Since 


by part (b) of Lemma 6.6, and 


by part (b) of this proposition, then 


N((a)) = |Nr(@)|; 
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by Corollary 2.8 on page 85. Hence, 
I6((a))| 


na 


Similarly, it may be shown that |u?| = 1 for all o € Gal(F/Q). Therefore, by Corollary 3.10 
on page 128, a € Rp. In other words, u = +¢/ for some j € Z. 


=1. 


We need one more notion in order to state the Eisenstein Reciprocity Law. 


Definition 6.13 — Primary Cyclotomic Integers 
Let r > 2 be a prime, F = Q(¢,), and a € Op. Then a is called primary if gcd(a,r) = 1 
and 

a=z (mod (1-¢,)?) 


for some z € Z. 


Remark 6.12 We do not need the notion of semi-primary here, which is what Hilbert 
called these a. He needed a stronger notion of primary in order to prove Kummer’s Reci- 
procity Law (see [38]). Hilbert called an element a primary if it is semi-primary, or what we 
have defined here as primary, together with the additional property that a@ is congruent 
to a rational integer modulo (1 — ¢,)"~1. 


Theorem 6.15 Eisenstein’s Reciprocity Law 


Let r be an odd prime, F' = Q(¢,), a € Z and a € Or be a primary element such that 
gcd(r,a) = 1 = gced(a,a). Then 


Proof. By Proposition 6.5 on page 295, it suffices to prove this result for a = p; a prime. Let 


pi bea prime 9 p-ideal above p; with N(pi) = pl = q. Then by hypothesis, gcd(p1,r) = 1. 


Claim 6.23 (SD) = (Med) | 
Pi - a - 


By part (a) of Lemma 6.6, it suffices to prove the claim for (a) = p, a prime D-ideal with 
N(p) = pf =q. Thus, in Dp we have the following congruences: 


q-1 , ai ss 
G(p)" = e x5” (0)) Ge. = S~ xs” (x)Cpta/te ) (mod 71), 


since gq; = 1(mod r) by Corollary 5.13 on page 218. Therefore, the above is in turn congru- 
ent to 


q-1 q-1 
r = Trg p (2) _ (4 r Trg p (&) aif 
ExPartag = (2) VP = (2) ete) (aod p. 
rT x=0 r 
Also, in Or, we have the following congruence: 


G(p) 271 = G(p)@-Y/t = (<o 


Pi 


(202) =() (222), mim 


Since r ¢ pi, then we must in fact have equality in the last congruence, which establishes 
Claim 6.23. 


iN (mod pi). 


Hence, 
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Claim 6.24 Let 6(a) = +¢/a7, where 7 is given in Lemma 6.7 on page 311. Then 


( : iN ( ) ( a) 
Pi (pi) a 
By Claim 6.23 and pal (c) of Lemma 6.7, 


(i) (Stan) = (22) (=). 
ee ae 


by Proposition 6.6 on page 295, then 


(=) -(S-) -0Gs),- Gin), 


where the last equality comes from Theorem 5.5 on page 190. From (6.52), Claim 6.24 now 
follows. 


Claim 6.25 (wes). = (“e) 


By Claim 6.24 we need only show that (24) = 1. However, a is primary, so by Exer- 
cise 6.33 on page 317, +¢7 = +1. Thus, 


Coes 


since r is odd and N(p1) —1 = pi —1is even. This completes the proof of Claim 6.25. 


and since 


4+1)(N(P1)-V)/r =1, 


— 


By Proposition 6.5 on page 295, Claim 6.25 says that 


Gea ee. 


However, since pl = 1(mod r), then fi | (r — 1), so ged(fi,r) =1. Therefore, 


Gee 


which completes the proof of the Eisenstein Reciprocity Law. 


One of the more pleasing applications of the Eisenstein Reciprocity Law is the following 
result proved in 1912. This was an important development in the long search for a proof of 
FLT. From this result will follow another important such result proved by Wieferich. 


Theorem 6.16 — Furtwangler’s Theorem 


Let x,y,z € Z be pairwise relatively prime, and let p > 2 be a prime such that 
xP +yP +2? =0. (6.53) 
If p{ yz and q is a prime divisor of y, then 


q?-'=1 (mod p’). 
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Proof. We remind the reader that, as we saw in the proof of Theorem 4.4 on page 152, 
(x + GSy) is a p*” power in Op for any j > 0, where F = Q(¢,). Let 

u=(aty)P7y, a = (a ty)? (x + Gpy), and A= 1 — Gp. 
Then since 1 +¢,y = «+y—yA, and by Exercise 4.19 on page 162, (c+y)?~' = 1(mod A?), 
Gp *a = (1-A) “a = (1-A) “((wt+-y)P*—yA(aty)? 7) = (14udA)(1-uA) = 1 (mod ”). 


This shows that ¢, “a is primary. Thus, by the Eisenstein Reciprocity Law, 


oe nha) Ga 


Since (¢>“a@) = (a) is a p™” power, then 


(5), @, ws 


However, q | y and a= (x +y)?~'(mod gq), so 


Glee aa. = (ea 


since (x + y) is a p’” power. Thus, by (6.54), 


(@) =1 (6.55) 


Claim 6.26 If g = grjo(q) and f = fr/g(q), then 


“) — ¢9(a' -1)/p 
({ 7 ‘p , 


Let gOr = []_1 aj. Then 
(2) = Il (=) = II c(a" D/P _ child Dip _ ca(a! D/P 
oy: = a = P ~~ SP — Sp ’ 
I) p j=l V7 p j=l 
by Definition 6.7 on page 294. Therefore, by (6.55), 


gf — 


ug : =0 (mod p). (6.56) 


Since g | (p—1) by Theorem 5.4, then p{ g and since u = (x+y)?~*y, then p{ u. Therefore, 
by (6.56), (¢/ — 1)/p = 0(mod p). In other words, 


qi =1 (mod p’). 


Since f | (p — 1), then we have Furtwangler’s result. 
A simple consequence of Theorem 6.16 is the following, proved in 1909. 
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Corollary 6.6 — Wieferich’s Theorem 
If 2? + y? + z? = 0 has a solution for nonzero x,y,z € Z with p{ xyz, then 

2?-'=1 (mod p’). (6.57) 


Proof. Clearly one of x,y, z is even, so we may assume without loss of generality that 2 | y. 
By Furtwangler’s Theorem with g = 2, the result follows. 


Remark 6.13 Primes p satisfying (6.57) are called Wieferich primes. The first two such 
primes are p = 1093 and p = 3511. More generally, if we replace 2 with any b € N with 
b > 2 and require that they satisfy (6.57), then these are also treated as Wieferich primes. 
It is unknown if there are infinitely many such primes for a given base b. It is not even 
known if there are infinitely many such that (6.57) fails to hold for a given base b. Examples 
of bases b > 2 for which (6.57) holds are (b, p) = (5, 53471161), (7, 491531), (11, 71). 


There have been many generalizations of the Eisenstein Reciprocity Law given by Artin, 
Hasse, Hilbert, and Takagi (see [38] for an overview). Some are beyond the scope of the 
theory presented in this book. For instance, for a statement of a general reciprocity law 
using local class field theory see [15, pp. 167-168]. For the Artin Reciprocity Law given 
in terms of idéles (introduced by Chevalley in order to give an approach different from the 
classical one that allows global class field theory to be deduced from the local one), see 
Tate’s article in [10, Chapter VII, pp. 162-203]. One may also consult Hasse’s article in 
[10, Chapter XI, pp. 266-279]. In fact, we conclude this section with the statement of a 
general reciprocity law that is within the purview of the theory provided herein. 


Theorem 6.17 — The Artin—Hasse Reciprocity Law 


Let F = Q(¢,) where r > 2 is prime and a,f € Op such that gcd(a,8) = 1, a=1(mod r), 
and 6 = 1(mod A), where \ = 1 — ¢,. Then 


a B one Tro(*= 2) 
§) (3), =¢ : 
Proof. See [38]. 


To illustrate the power of Theorem 6.17, we show how to easily achieve the Eisenstein 
Reciprocity Law from it, as a closing feature of this last section of the main text. The proof 
in the following was communicated to this author by Franz Lemmermeyer in the writing of 
the first edition. 


Example 6.12 Since 


then it suffices to show that 


(S ) (3) 

r—1 = r—-1 _ 

a r a r 
1 


To this end, let b = a’~! = 1(mod r) and 8 = a’~' = 1(mod A). Thus, Theorem 6.17 


applies since 
b-1 -1 b-1 -—1 
tna (22-223) 82 yg (222) 20 (amt) 


r 
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given that r | (8 —1)/A when a is primary. Hence 


The Theorem 6.17 is one of the simpler formulations found by Artin and Hasse in a search, 
between 1923 and 1926, for what Hilbert called “The most general reciprocity law.” The 
quest continues into the realm of non-abelian class field theory, spearheaded by the work of 
Langlands and Shimura, and carried on by numerous others. 


Exercises 


6.32. Let r > 2 be a prime, F' = Q(¢,), and I an Op-ideal such that ged(r, 1) = 1. Prove 
that 


6(I) =+1 (mod r). 


6.33. Let r > 2 be prime, F = Q(¢,), and a € Or a primary element. Prove that 6(a) = 
+a‘™, where 7 is given in Lemma 6.7 on page 311. 


6.34. Suppose that a € Z, and £@ is a rational prime such that ¢{ a. Prove that 


x’ =a (mod p) 


is solvable for all but finitely many primes p if and only if 
a=0v' 


for some 0 € Z. 


In a course in elementary number theory, one quickly learns the fact that an integer a, 
which is a square modulo all primes 0, must be the square of a rational integer. This 
is usually given as an application of the Jacobi symbol. This exercise is intended to 
substantially generalize that fact as an application of the Eisenstein Reciprocity Law. 


Biography 6.2 Phillipp Furtwangler (1869-1940) was born on April 21, 1869 
near Hildesheim, Germany. By the age of fourteen, he had lost both of his 
parents. He went to school in Hildesheim, then went to Gottingen in 1889. 
At this time Hilbert had not yet arrived at G6ttingen, but Fricke and Klein 
were there. Furtwangler completed his dissertation on ternary cubic forms in 
1896. He then held numerous positions. The first was as an assistant at the 
Geodesic Institute in Potsdam from 1897 to 1903. In 1903, he married Ella 
Buchwald, but she died shortly after the birth of their daughter. Then he 
was at the Agricultural Academy in Bonn from 1903 to 1907, after which he 
taught at the Technical University in Aachen, then returned to Bonn. His 
activities during those years included a proof of the reciprocity law for prime 
powers, and establishment of the existence of Hilbert class fields. In 1912, he 
succeeded Mertens at the University of Vienna. While at Vienna, his research 
activities included the problem of capitulation in Hilbert class fields, and a proof 
of Hilbert’s principal ideal theorem in 1930. He also worked in Diophantine 
approximation, the geometry of numbers, and FLT. In 1929, he married Emilie 
Schon at a time when he was already quite ill, and had to retire in 1938. He died 
on May 19, 1940 in Wien. There are streets in Germany named Furtwangler 
after Phillipp’s distant relative, Wilhelm Furtwangler, the famous conductor 
and composer. 
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Appendix A 


Abstract Algebra 


It’s hard to beat a person who never gives up. 
Babe Ruth (George Herman Roth) (1895-1948) 
American baseball player 


The purpose of this appendix is to give a review of the background material required for 
understanding the concepts in the text as a finger-tip reference to the basic concepts in 
abstract algebra. We do this via a discussion of the fundamental concepts, without proofs, 
so the reader may be reminded of the salient background information without having to go 
to another source. However, if proofs are required, the reader may consult such standard 
texts as [29]. 


First, we will consider the following set of axioms, and discuss certain sets 8, together with 
binary operations of addition, denoted by +, and multiplication, denoted by juxtaposition 
or by - the multiplication sign. We will determine which sets satisfy certain of these axioms, 
and thereby introduce the various concepts in a basic course in abstract algebra 


+ Basic Axioms 


A.l. For alla,8 €8,a+8€8. (Additive closure) 

A.2. For alla,8 €8,a8€8. (Multiplicative closure) 

A.3. For alla,G €8,a+6=8+a. (Additive commutativity) 

A.A. For all a, B,y € 8, (a+ 6)+y=a+ (64+). (Additive associativity) 


A.5. There is a unique z € § with z+a=a+z=a. (Additive identity) 


(When no confusion can arise, we use the symbol 0 here for the additive identity z, 
since it mimics the ordinary zero of the integers.) 


A.6. To each a € 8, there is an a* € § such that a+a* =a*+a=z. (Additive inverse) 
A.7. For alla,8 €8, a8 = Ba. (Multiplicative commutativity) 
A.8. For all a, B,y € 8, (a8)y = a(By). (Multiplicative associativity) 


A.9. There exists a unique 1g € 8 such that for each a € 8, lsa = alg = a. (Multiplicative 
identity) 


(Here, as with the additive identity above, we can use the symbol 1 in place of the 
multiplicative identity 1s, when no confusion will arise from so doing, since 1s mimics 
the function of this multiplicative identity of the integers.) 


A.10. For all a, B,y € 8,a(6+y7) =aB+ay. (Distributivity) 


A.11. For all a,8 € 8, if a8 = z, then a = z or B =z. (No zero divisors) 
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A.12. For any a € 8, with a ¥ z there exists an element denoted by a~! such that aa~! = 


1s =a'ta. (Multiplicative inverse) 


+ Groups 


An additive group is a set satisfying A.1 and A.4—A.6. Similarly, a multiplicative group is 
a set satisfying axioms A.2, A.8-A.9, and A.12. 


In the text—see the proof of Theorem 2.10 on page 77, for instance—we will have need of 
a special multiplicative group as follows. 


Definition A.1 — The Symmetric Group 


The symmetric group on n letters, denoted by Sy, is the set of all bijections of {1,2,...,n}. 
Multiplication is given by composition of functions, which is associative. The identity map 
is the identity of S,,, and there are unique inverses since bijections are one-to-one and onto 
(see the section on Mappings—Morphisms starting on page 326). The elements of S,, are 
called permutations, and the cardinality of S, is n! = n(n—1)(n—2)---2-1. A transposition 
is an element o € S, such that o interchanges two elements of {1,2,...,n}, while leaving 
all of the others fixed. 


A basic fact concerning permutation groups is that all permutations are expressible as a 
product of transpositions. This leads to a finer classification of the elements of Sy. 


Definition A.2 — Even and Odd Permutations 


Ifo € S, and a is the product of an odd number of transpositions, then o is called an odd 
permutation. If o is the product of an even number of transpositions, then o is called an 
even permutation. The set of all even permutations forms a subgroup of S,,, denoted by 
An, called the alternating group on n symbols, and |A,| = n!/2. The sign of a permutation 
a, denoted by sgn(o), is 1 or —1 according as o is even or odd. The sgn is a well-defined 
map since it can be shown that a permutation cannot be both odd and even. 


Definition A.3 — Abelian Groups 


Any set which satisfies A.1, and A.3-A.6 is an additive abelian group, and if it satisfies A.2, 
A.7-A.9, and A.12, then it is a multiplicative abelian group. If G is a multiplicative abelian 
group, then G is cyclic whenever the group generated by some g € G, coincides with G. 
The element g is the generator of G, denoted by 


G = (9). 


If g” = 1g for some n € N, then the smallest such n is the order of the finite cyclic group 
G, denoted by n = |G|. If no such n exists, then G is said to be an infinite cyclic group. A 
group P is called an elementary abelian p-group for a prime p € Z if every element x € P 
satisfies z? = 1. If P is the maximum elementary abelian p-subgroup of a group G, and 
|P| =p", then r is called the p-rank of G. This means that 


P= OQ x-++x Cy, 
e—{!=-—_“— 
r factors 
where Cy, is a cyclic group of order p, and G does not contain a subgroup of this type with 
more than r factors.“"! 


A-1Note that this definition of rank is valid only for abelian groups. In general, one may define the rank 
as the number of factors of the maximal p-elementary abelian “factor” group. For instance, the quaternion 
group has 2-rank 2, and its maximal elementary abelian subgroup is Z/2Z. 
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The following basic result is useful in the main text—see the proof of Lemma 5.12 on 
page 244, for instance. 


Remark A.1 In the following, a direct product of groups is formed by the component-wise 
multiplication of elements. 


Theorem A.1 — Fundamental Theorem of Finite Abelian Groups 


If G is a finite abelian group, then any two decompositions of G into a direct product of 
cyclic groups of prime power order contain the same number of multiplicands of each order. 


+ Cosets of Groups 


Let G, H be arbitrary groups with H C G. Then H is a subgroup of G. Suppose that g € G 
is fixed, and set 

gH ={gh:he H}. 
Then gH is a left coset of H in G determined by g. A right coset is similarly defined. If G 
is abelian, then left and right cosets are equal, and we refer merely to a coset of H in G. 
Let 91,92 € G be fixed. Either 

nH nH =o 

or 


ciped = goH. 


Furthermore, the group G is partitioned into disjoint left cosets of H. The number of 
distinct left cosets of H in G is denoted |G : H], called the index of H in G. (In particular, 
|G| is the order of G.) Moreover, it is an easy task to verify the following fact. 


Proposition A.1 — Group Criterion 


If G is a group, then the nonempty set H C G is a subgroup of G if and only if hyhy' € H 
for all hi, he € HA. 


Given the above discussion, we may conclude that G is partitioned into a disjoint union of 
|G : H| subsets, each containing |H| elements. Thus, by counting the number of elements 
in G, we get the following. 


Theorem A.2 — Lagrange’s Theorem 


If G is a group and 4H is a subgroup of G, then 

IG|=|G: Hl -|Al. 
Corollary A.7 If G is a finite group and |G| = n, then |g| | |G|, and g” = 1 for all g € G. 
Theorem A.3 A finite abelian group of order n € N has subgroups of all orders dividing 


n. 


Given the above setup, we may now define another group. 
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Definition A.4 — The Quotient Group and Normal Subgroups 
Let G be an abelian group, and H a subgroup of it. The quotient group 


Ga G/H HIG, sont — {nag} 
of G by H is the group with multiplication defined by 
959k = 9j)9kH = 9; Hg.H = geH = Gg, 


for some ¢ = 1,2,...,n, having identity lq = H = Ig, and inverses G = Gat. The 
mapping = 

w:Gr G, given by pi gu yg 
is called the canonical map, or natural map. 


If G is not an abelian group, then in order to form the quotient group one needs the following 
concept. A subgroup H of G is called normal provided that gH = Hg for all g € G. In 
other words, the left and right cosets of H in G agree, or that H is always conjugated 
to itself, namely g-'Hg = H for all g € G. When H is normal in G, we may form the 
quotient group G as the set of all products of cosets. Since left and right cosets agree, 
then the product of any two cosets is again a coset of H in G, so G is a group with this 
multiplication. 


+ Rings and Fields 


Definition A.5 — Rings, and Fields 


(1) A ring is a set together with two binary operations called addition and multiplication, 
denoted by + and x, satisfying the following: 


(a) R is an abelian group under addition. 
(b) Multiplication is associative: (a x b) x c=a x (bx c) for all a,b,c ER. 


(c) The distributive law holds in R, namely for all a,b,c € R multiplication is 
distributive over addition, namely 


ax (b+c) =(ax b)+(axc) 


and 


(a+b) xc=(axc)+(bxc). 
(2) If (1) holds, and multiplication is commutative, then R is called a commutative ring. 
(3) If (1) holds, and there is an element 1 € R such that 
1xa=ax1l=aforalla€c R, 
then R is called a ring with identity. 


(4) A ring with identity 1 4 0 is called a division ring or skew field if every nonzero element 
of R has a multiplicative inverse. 


(5) A commutative division ring is called a field. 
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Definition A.6 — Subrings 


A subring of a ring R is a subgroup of R that is closed under multiplication. 


Remark A.2 Definition A.6 says that, in practice, to show that a subset of a ring Risa 
subring it suffices to show that it is nonempty and closed under subtraction and multiplica- 
tion. 


+ Modules 


Suppose that M is an additive abelian group, and that R is a ring, which satisfy each of 
the following axioms: 


A.13. For each re R, me M,rme M. 

A.14. For each r € R and m,n € M, r(m+n) = (rm) + (rn). 
A.15. For each r,s € Rand me M, (r+s)m= (rm) + (sm). 
A.16. For each r,s € Rand m € M, r(sm) = (rs)m. 

A.17. If R has identity 1g, then for each m € M, lpm =m. 


Then M is a left module over R. If R is a commutative ring with identity, then M is botha 
right and a left R-module called a two-sided, unitary module or for our purposes, simply an 
R-module. For example, being a Z-module is equivalent to being an additive abelian group. 
If R is a division ring, then M is called a vector space, and multiplication from R is called 
scalar multiplication, with the elements of M called vectors.” 


A submodule of an R-module M is a subset N of M such that 
A.18. N is a subgroup of the additive group of M, and 
A.19. For allr€ R,andnée N,rne N. 


It follows that a subset N of M is an R-submodule of M if and only if 
A.20. OE N, 

A.21. For all m,n € N, m—n€ N, and 

A.22. For allr€ R,andnée N,rne N. 


For instance, if G is an additive abelian group, then for any n € Z, 


ng=t(g+g+---+49). 
Rp 
|n| copies 
Therefore, abelian groups are Z-modules and the submodules are just the subgroups thereof. 
Let m € M be fixed, and let N be a submodule of M. Define 
m+N={m+n:neEN}, 


the coset of Nin M determined by m. 


A.2There is a more general definition of vector space (and of a module), which we do not need in this text. 
For the more general setup, and details pertaining to it, the reader may consult [29, p. 169 ff]. 
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A.23. For any m,,m2 € N,m,+N =m2+N if and only if mm, + N = mm2+N for any 
meéeM. 


A.24. Define 
M/N={m+N:meM}. 


Then M/N is an R-module, called the quotient module of M by N. If M/N is finite, 
we denote its order by 
|\M/N|=|M : NI, 


the index of N in M. 
IfS ={M; : 7 =1,2,...,n} is a set of R-modules, then let M be the set of n-tuples 
(m1,m2,...,Mn) with m; € M; for 7 = 1,2,...n, 


with the zero element of M being the n-tuple, (0,0,...,0). Define addition in M by 


(m1, M2,...,Mn) + (m4, mg,...,m,) = (m1 + m4, m2 + mg,...,™Mn +m,,), 
for all m,;, mi € M; with j = 1,2,...,n, and multiplication from R on an n-tuple from M 
by 
r(m1,M,---,Mn) = (rm, rMg,...,7My) for all r € R. 


This defines an R-module structure on M called the direct sum of the modules M;, 7 = 
1,2,...,n, denoted by 
By M; =M6::-@M,. (A.1) 


Definition A.7 — Bases, Dependence, and Finite Generation 


If S is a subset of an R-module M, then the intersection of all submodules of M containing 
§ is called the submodule generated by S, or spanned by 8. If there is a finite set 5, and $ 
generates M/, then M is said to be finitely generated. If S = @, then S generates the zero 
module. If 8 = {m}, a singleton set, then the submodule generated by 8 is said to be the 
cyclic submodule generated by m. 


A subset 8 of an R-module M is said to be linearly independent provided that for distinct 
$1, 82,---,5n € 8, andr; € R for 7 = 1,2,...,n, 


n 
Sos; = 0 implies that r; = 0 for 7 = 1,2,...,n. 
j=l 


If S is not linearly independent, then it is called linearly dependent. A linearly independent 
subset of an R-module that spans M is called a basis for M. 


An important concept that we will need throughout the text is the following notion—see 
Theorem 2.9 on page 75, for instance. 


Definition A.8 — Free Modules and Free Abelian Groups 


If R is a commutative ring with identity, and M is an R-module, then M is called a free 
R-module if M has a nonempty basis. 
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Remark A.3 The situation of most interest to us in the text is that of a free Z-module, 
which is just a Z-module with a basis, and this is the same thing as a free abelian group. It 
can be shown that any two such bases for a free abelian group G have the same cardinality. 
Therefore, this cardinality is an invariant of G, called the rank of G. If the number of 
elements in a basis is finite then the free abelian group is said to be of finite rank. Further- 
more, it can be shown that all subgroups of a free abelian group G are also free abelian 
with rank at most that of G. 


Vector spaces (over division rings, remember) are special. 


Theorem A.4 — Vector Spaces and Dimension 

If V is a vector space, and § is a subset that spans V, then S$ contains a basis of V. 
Furthermore, any two bases of V have the same cardinality. This is called the invariant 
dimension property. 

The cardinality of a basis for a vector space V over a division ring D is called the dimension 
of V over D, denoted by |V : D|. A submodule of a vector space is called a subspace. 


Application A.1 — Field Extensions 

If D CV, where D and V are fields in Theorem A.4, then V is called an extension field of 
D and |V : D| is called the degree of the field extension. It follows that if Fy C Fh C F3 CC 
with F) fields for 7 = 1, 2,3, then 


|F3: Fi] =|F3: Fo|-|Fo: Fil. (A.2) 


If S is a subset of a field F’, then we call the subfield generated by & the intersection of all fields 
containing F' and containing S. If FE is an extension field of F and 8 C E, then the subfield 
generated by F and 8 is defined as the subfield generated by F US. If 8 = {a1, a2,...,an} 
for some n € N, then the field generated by S and F' is denoted by 


E = F(ay,02,..., Qn) 


called a finitely generated extension of F. In the case where n = 1, EF is called a simple 
extension of F. 

When K, F C C are fields, then the compositum of K and F, also called the composite, is 
the smallest subfield of C containing both K and F’. This consists of all finite sums }> a; 3; 
where a; € K and §; € F. In particular, for the simple extensions defined above, we have 
that whenever a € C is algebraic over F’, then 


Flo] = F(a) = Fla]/(f(2)), (A.3) 


where the generator of the ideal, given by f(a), is an irreducible monic polynomial uniquely 
characterized by the conditions: (1) f(a) = 0, and (2) if g(a) € Fla] with g(a) = 0, then 
f(z) | g(a). See Example 1.22 on page 19, for instance, as an application of this result. 


In the above, we defined free R-modules. We may now present another characterization of 
those free R-modules of finite rank. An R-module M of rank n € N is free provided that 
it is isomorphic to a direct sum of n copies of the R-module R. In particular, every free 
Z-module M of rank n is of the form 


M~Z@.---@Z, (A.4) 
—_-__— 


n copies 
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and this is called a free abelian group of rank n. Thus, every subgroup of a free abelian 
group of rank n is a free abelian group of rank at most n. 


+ Mappings — Morphisms 
Let S and T be sets. Then a function (or mapping) 
f:SH6T 
is called an injection (or one-to-one) provided that 
a = 6 whenever f(a) = f(b) for any a,b € S. 
It is called surjective (or onto) provided that f(S) = T, namely 
T = {f(s): 8 € S}, 


or in other words, for each t € T, there exists an s € S such that f(s) =t. A function f is 
called bijective (or a bijection) if it is both an injection and a surjection. 


Suppose that G and H are two groups where - denotes the operation in G, and ® denotes 
the operation in H. If 
f[:Gvond 


is a function such that 

f(g + 92) = f(g1) ® f(92), 
then f is called a homomorphism of groups (or group homomorphism). When there is no 
danger of confusion, we express ® and - simply by juxtaposition, and write 


f(g192) = fm) F(g2), 


for convenience. We will maintain this convention in the sequel, namely that we will not 
distinguish between the operations in the objects under consideration. 


Definition A.9 — Auto, Endo, Iso, and Mono-Morphisms 


If f : GH F is injective as a map of sets, then f is called a monomorphism of groups (or 
group monomorphism), and f is called an epimorphism of groups (or group epimorphism) 
provided that f is surjective as a map of sets. If f is bijective as a map of sets, we call 
it an isomorphism of groups (or group isomorphism). When the context is clear, and no 
confusion can arise, we drop the reference to groups and call f simply a monomorphism, 
epimorphism or isomorphism. A homomorphism 


f[:GuHG 


is called an endomorphism of G, and if f is an isomorphism, then it is called an automor- 
phism of G. A field automorphism is an isomorphism of F' satisfying the two properties 


that f(aB) = f(a) f(8) and f(a+ 8) = f(a) + f(8) for alla,@ € F. 
The kernel of f : GH FH is given by 


ker(f) = {g € G: f(g) = 0}. 
Also, the image of G under f is given by 
img(f) = {h © H: f(g) =h for some g € G}. 
If S is a subset of H, then 
f-'(S) ={9 €G: f(g) € S} 


is called the inverse image of S. 
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Remark A.4 The set of all automorphisms of a group G forms a group itself under com- 
position of functions, denoted by Aut(G)—see Lemma 2.1 on page 55 for an application to 
field extensions. For instance, the group of automorphisms of a finite field Fp» is a cyclic 
group of order n. 


Definition A.10 — Homomorphism and Embeddings of Rings 


Let R and S be rings. Then a function f : Rt S is a homomorphism of rings provided 
that for alla,be R 


f(a +6) = f(a) + f(b) and flab) = f(a) f(). 


The homomorphism f is called a monomorphism of rings (or embedding of R into S, or 
an injection of rings, or a one-to-one homomorphism of rings), if f is injective as a map 
of sets. Also, f is called an epimorphism of rings if f is surjective as a map of sets. If f 
is a bijection as a map of sets, then f is called an isomorphism of rings. When f is an 
isomorphism of rings, we say that R and S are isomorphic, and write 


RS. 


As above, we abbreviate to say simply homomorphism, monomorphism, epimorphism, or 
isomorphism, when the context is clear. Also, the kernel and image inherit the group 
structure from a map of additive abelian groups. If ¢ is an isomorphism of the ring R, and 
S is a subring of R, then the isomorphism given by 


s+ ¢(s) for alls € S 


is called the restriction isomorphism of ¢ to S, denoted by 


d|s- (A.5) 


Definition A.11 —  Cosets and Quotient Rings 


If R is a commutative ring with identity and J is an R-ideal, then a coset of I in R is a set, 
for a given r € R, of the form r+ I = {r+a:a€T}. The set 


R/T ={r+I:reR} 
becomes a ring under addition and multiplication of cosets given by 
(r+ I)(s+1) =rs+Tand (r+J)+(s+l)=(r+s)+J forr,sER 


which is independent of the choices of r,s. Then R/T is called the quotient ring or the 
factor ring of R by I. It is also referenced as the residue classes of R modulo I. 
A mapping 

f:ROR/I, 


which takes elements of R to their coset representatives in R/J, is called the natural map 
of R to R/T, and this is easily seen to be an epimorphism. In this case, the cardinality of 
R/T is denoted by |R: J]. 


If M and N are modules over a ring R, then a function 


f:MbHN 
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is an R-module homomorphism provided that for all mj1,m2 € MandreR 


f(mi + mz) = f(m1) + f(m2) and f(rm1) = rf(m1). 


If R is a division ring, then an R-module homorphism is called a linear transformation. 


Since an R-module homomorphism is necessarily a homomorphism of additive abelian 
groups, then the same terminology is carried over to f as an R-module monomorphism, 
epimorphism, or isomorphism, provided that f is injective, surjective, or bijective (re- 
spectively), as a map of sets. Hence, the kernel (respectively image), of f is its kernel 
(respectively image), as a homomorphism of abelian groups. 


If R is a commutative ring with identity, then an R-algebra is a ring A such that 
A.25. Ais an R-module, and 
A.26. r(ab) = (ra)b = a(rb) for allr € Rand a,bé A. 


Any R-algebra that is (as a ring) a division ring, is called a division algebra. An algebra 
over a field K is called a finite dimensional algebra over K. A homomorphism (respectively 
momomorphism, epimorphism, or isomorphism), of R-algebras 


f:AvB 


is a ring homomorphism (respectively momomorphism, epimorphism, or isomorphism), that 
is also an R-module homomorphism, (respectively momomorphism, epimorphism, or iso- 
morphism). Also, as in the ring case, the notion of kernel and image of f are inherited 
from the group structure. 


Fundamental results concerning isomorphisms will be needed in the text. The following is 
a fundamental result on isomorphisms of which (A.3) on page 325 is an application. 


Theorem A.5 — Fundamental Isomorphism Theorem for Rings’? 
If R and S are commutative rings with identity, and 
@:RHS 

is a homomorphism of rings, then 

R/ ker($) © ime(4). 
+ Rings of Quotients 
In this section, we look at a generalization of the construction of the rational number field. 
Definition A.12 — Multiplicative Sets 


A nonempty subset S$ of a ring R is called multiplicative provided that 


r,s € S implies that rs € S. 


The classical motivation for the following is to think of the set S of nonzero rational integers. 
This is a multiplicative subset of Z. One may construct Q from the relation on the set Zx S$ 
given by 

(a,b) ~ (c,d) if and only if ad — bc = 0, 


A83This holds for more general rings, but our principal object of study in this text is the ring of integers 
of a number field, so we look only at this case for convenience. See [29] for the more general case. 
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which is an equivalence relation (namely a binary relation R that is reflexive (aRa), sym- 
metric (ab implies bRa), and transitive (aRb and bRc imply aRc)). 


Then Q is the set of equivalence classes, denoted by 


{(a, b)} = a/b 


with addition and multiplication defined in the usual way. These well-defined operations 
make Q into a field, and the mapping z +> z/1 embeds Z in Q. We now generalize this 
setup. 


Theorem A.6 — Ring of Quotients** 


Let S be a multiplicative subset of an integral domain R. Then the relation on R x S 
defined by 
(a,b) ~ (c,d) if and only if ad — bc = 0 


is an equivalence relation. Denote the set of equivalence classes arising from this equivalence 
relation by S~'R. If 0 ¢ S, then S~'R is an integral domain, called the quotient ring of R 
or ring of fractions or ring of quotients of R by S. If S' is the set of all nonzero elements of 
R, then S~'R is a field called the quotient field of R. In the latter case, the map 


w: Rt SR given by r+ rs/s for any s € S$ 


is a monomorphism that embeds R in its quotient field. Thus, ~(s) is a unit in S~'R for 
each s € S. 


+ Polynomials and Polynomial Rings 


If R is a ring, then a polynomial f(a) in an indeterminant x with coefficients in R is an 
infinite formal sum 


[o.e) 
f(t) = Saja? Sag tae te tana" te, 
j=0 


where the coefficients a; are in R for 7 > 0 and a; = 0 for all but a finite number of those 
values of j. If a, # 0, and a; = 0 for 7 > n, then a, is called the leading coefficient of 
f(a). If the leading coefficient a, = 1, then f(x) is said to be monic. The set of all such 
polynomials is denoted by R[x]. 


We may add two polynomials from R[z], f(a) = 0529 aja? and g(x) = Yoo bya, by 
F(x) + o(x) = “(aj + y)a? € Ria], 


and multiply them by 


where 


A-4This setup applies to any commutative ring, but our main concern in this text is rings of integers, which 
are integral domains, so we specialize to that case here. 
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Also, f(x) = g(a) if and only if a; = b; for all j = 0,1,.... Under the above operations 
R{a] is a ring, called the polynomial ring over R in the indeterminant x. Furthermore, if 
R is commutative, then so is R[x], and if R has identity 1p, then 1p is the identity for 
R(x]. Notice that with these conventions, we may write f(x) = )7j_) aja? where ay is the 
leading coefficient since we have tacitly agreed to “ignore” zero terms. 


Note that we could dispense with the indeterminant altogether and write 
f = (0, @1,---,@n,---)- 


Then the above operations would be on these sequences of elements. Note that f(x) is 
not a function and the + in its representation does not represent addition. This is made 
clear by the sequential notation. Thus, the abbreviated notation that we have adopted, 
f(a) = yz aja’, is called the sigma notation, rather than the summation notation. 


If a € R, we write f(a) to represent the element en aja € R, called the substitution of 
a for x. When f(a) = 0, then a is called a root of f(x). The substitution gives rise to a 
mapping f : Rt R given by f : a+ f(a), which is determined by f(x). Thus, f is called 
a polynomial function over R. 


Example A.1 Let R = Z/pZ where p is prime. If f(x) = x? and g(x) = x, then these 
two polynomials of R[x] are distinct. However, f(a) = a? and g(a) = a. However, by 
Fermat’s Little Theorem, a? = a in R. Hence, distinct polynomials can give rise to the 
same polynomial function. (For a detailed discussion of related polynomial congruences and 
the theory behind them, see [50, pp. 105-117]). 


Definition A.13 — Degrees and Division of Polynomials 


If f(x) € Ria], with f(x) = sy a;x/, and aq # 0, then d is called the degree of f(x) over 
R, denoted by degp(f). If no such d exists, we write degp(f) = —o0, in which case f(x) is 
the zero polynomial in R[a|—see Example A.2 on the next page. We say that a polynomial 
g(x) € Ria] divides f(a) € Rix], if there exists an h(x) € Rix] such that f(x) = g(x)h(a). 
We also say that g(x) is a factor of f(a). If F is a field of characteristic zero, then 


degg(f) = deg p(f) 


for any f(a) € Q[a]. In this case, we write deg(f) for degp(f), without loss of generality, 
and call this the degree of f(x). 


+ Polynomial Congruences 


Theorem A.7 — Lagrange’s Theorem 


Suppose that f is an integral polynomial of degree d > 1, and p is a rational prime. Then 
f(x) = 0(mod p) has at most d incongruent solutions. 


If c is the greatest common divisor of the coefficients of f(a) € Z[x], then c is called the 
content of f. If c= 1, then f is called primitiveA® 


A5The content of a polynomial f is also defined more generally when f(x)¢€D[x], where D is a UFD—see 
Definition 1.8 on page 7. The content is not uniquely defined since common divisors are not unique given 
the existence of units. However, any two contents are necessarily associates in D—see Definition 1.5 on 
page 4. In Z, this does not present a problem since the only units are +1, so the gcd (which is positive), 
must be unique. If D is a general UFD, then a polynomial is primitive if the content is a unit in D. 
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Definition A.14 — Irreducible Polynomials over Rings 


A polynomial f(a) € R[z] is called irreducible (over R), if f(a) is not a unit in R and any 
factorization f(#) = g(x)h(x), with g(x), h(x) € R[x] satisfies the property that one of g(x) 
or h(x) is in R, called a constant polynomial. In other words, f(x) cannot be the product 
of two nonconstant polynomials. 


For the following application we remind the reader that a finite field, denoted by Fy with 
q € N elements must satisfy the property that q is a prime power. Such fields are also called 
Galois fields. If q = p™ for a prime p and m €N, then F,, is called the prime subfield of Fy. 
In general, a prime subfield is a field having no proper subfields, so Q is the prime subfield 
of any field of characteristic 0, and F;, is the prime subfield of any field of characteristic p. 


Theorem A.8 — Multiplicative Subgroups of Fields 


If F is a field and F™ is a finite subgroup of the multiplicative group of nonzero elements 
on F’, then F™ is cyclic. In particular, if F = Fp» is a finite field, then F* is a finite cyclic 
group. 


In general, it is important to make the distinction between degrees of a polynomial over 
various rings, since the base ring under consideration may alter the makeup of the polyno- 
mial. 


Example A.2 The polynomial 
f(z) = 22? +22 +2 


is of degree two over Q. However, over F2, the finite field of two elements, degp, = —0o, 
since f is the zero polynomial in F2[z]. 


Some facts concerning irreducible polynomials will be needed in the text as follows—see 
Exercise 2.15 on page 64, for instance. 


Theorem A.9 — Irreducible Polynomials over Finite Fields 


The product of all monic irreducible polynomials over a finite field F whose degrees divide 
a given n €N is equal to 2%” — 2. 


Based upon Theorem A.9, the next result may be used as an algorithm for irreducibility 
over prime fields and thereby generate irreducible polynomials. First, we need a definition. 


Definition A.15 — The GCD of Polynomials 


If fi(x) € Fax] for 7 = 1,2, where F is a field, then the greatest common divisor of fi (x) 
and f(a) is a unique monic polynomial g(x) € F'[z] satisfying both: 


(a) For i= 1,2, g(x)|f;(2). 
(b) If there is a g(x) € F[z] such that g:(x)|fi(x) for 7 = 1,2, then g(x) |\g(z). 


If g = 1, we say that f(x) and fo(x) are relatively prime, denoted by 


ged (fi (x), fo(x)) = 1. 
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Corollary A.8 The following are equivalent. 


(a) A polynomial f is irreducible over F;, where p is prime and degg (f) =n. 
(b) For all natural numbers 7 < |n/2]|, gcd( f(x), a?’ — eS, 
There is a general result concerning irreducible polynomials over any field. 


Theorem A.10 — Irreducible Polynomials over Arbitrary Fields 


Let F be a field and f(x) € Fa]. Denote by (f()) the principal ideal in Fa] generated 
by f(x). Then the following are equivalent. 


(a) f is irreducible over F’. 


(b) F[z]/(f(x)) is a field. 


Our main concern in this text is with subfields of C. In particular, what is the relationship 
between degg(f), and degz(f)? This is answered by an important result of Gauss, which 
relates degrees, and irreducibility of polynomials in Q and Z. 


Lemma A.1 — Gauss’s Lemma*-® 


If f(x) € Za], and 
f(x) = g(@)h(w) for g(x), h(x) € Q{a}, 


then 
f(x) = G(x)H(x) for some G(x), H(x) € Z[a]. 


Furthermore, degg(g) = degz(G), and degg(h) = degz(H). 


Lemma A.1 tells us that any polynomial which is irreducible in Z[z] is also irreducible in 
Q[zx], or contrapositively, if f(x) is reducible in Q[z], then it is already reducible in Z[z]. 
Given this fact, it is useful to have an irreducibility test over Q. 


Theorem A.11 — Schénemann/Eisenstein Criterion*” 

Let f(x) € Z[a] with f(x) = ar a;x/. If there exists a prime p € Z such that both 
(a) aj =0(mod p) for 7 = 0,1,...,d—1 with ag  0(mod p), and 
(b) ao # O(mod p*) 

hold, then f(a) is irreducible in Q[z]. 


Now that we have the notion of irreducibility for polynomials, we may state a unique 
factorization result for polynomials over fields. 


4-6 Another lemma, also known as Gauss’s Lemma, says that the product of primitive polynomials in Z[z] 
is primitive in Z[z]. 

4-7 Although this is known as Eisenstein’s criterion in the literature, it was actually first discovered by 
T. Schénemann in [64]. He actually claimed priority over Eisenstein in [65]. The consensus is that 
Schénemann’s paper was overlooked because he put the criterion at the end of the paper without any 
applications or even a hint as to its importance, whereas Eisenstein put his at the front of the paper and 
demonstrated the applicability to such things as the irreducibility of the cyclotomic polynomials. 
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Theorem A.12 — Unique Factorization for Polynomials 


If F is a field, then every nonconstant polynomial f(a) € Fa] can be factored in F'[2] into 
a product of irreducible polynomials p(x), each of which is unique up to order and units 
(nonzero constant polynomials) in F’. 


The Euclidean Algorithm applies to polynomials in a way that allows us to talk about 
common divisors of polynomials in a fashion similar to that for integers. 


There is also a Euclidean result for polynomials over a field. 


Theorem A.13 — Euclidean Algorithm for Polynomials 
If f(x), g(x) € Fa], where F is a field, and g(x) # 0, there exist unique q(x), r(x) € Fla] 
such that 

f(x) = a(a)g(a) + r(@), 
where either 0 < deg(r) < deg(g), or r(x) = 0, the zero polynomial with deg(r) = —oo. 
Thus, F[2] is a Euclidean domain with respect to the valuation v(f) = 2%), with deg(0) = 
—oo, namely 248() = 9, 
Finally, if f(v) and g(x) are relatively prime, there exist s(z),t(x) € F'[a] such that 

1 = s(x) f(x) + t(x)g(@). 
An important concept that we will need, for instance, in the proof of Theorem 3.17 on 


page 127, is the following. 


Definition A.16 — Symmetric Functions 


Let R be a commutative ring with identity, and f(x) € Riw1,22,...,%n], the polynomial 
ring in n indeterminates x; for 7 = 1,2,...,n. Then f is called symmetric if for each 
a € S,, the symmetric group on n letters, 


Poles; T2,++- ,&n) = f(®o(1)1 Zo(2); aed ats )s 
denoted by simply f? = f. In particular, if 
8; (#1, 2, Ls En) € Rlx1, 22, oo , Ln] 


is defined to be the sum of all possible distinct products of j distinct 2;, then s; is a 
symmetric function called an elementary symmetric polynomial. Thus, 


n 
$1(@1,%2,.-.,%n) = Sy 
j=l 


82(@1,£2,...,2n) = S- TAX 5; 


1<i<j<n 
$3(X1,02,---,2¢n) = ) LjL iL; 
1<i<j<k<n 
8x,(21, £2;+++,2n) = 5 Ti, Vig Lins 


1<t1 <tg<-<igpcn 


n 
Sri @i4 2) “ , ta) = [[ - 


j=l 
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Theorem A.14 — Newton 


Let R be a commutative ring with identity. Then every symmetric polynomial in 
Rix, @2,.--, Ln] 


is expressible as a polynomial in 
Ris, S2,-++5 Sn]. 


Corollary A.9 Let R be acommutative ring with identity and f(x) € R[x] be a polynomial 
of degree d with roots a1,Q2,...,@q in R. If g(a1,72,...,va) is a symmetric polynomial 
over R, then 

g(a1,Q2,...,Q@a) € R. 


Definition A.17 — Splitting Fields for Polynomials 


If f(x) € F[x] where F C C is a field and a; for 1 < j < dare all of the roots of f(a) in C, 
then there exists a smallest extension field & of F' such that 


f(@) = (@— a1)(@ — a2) ++ (@ — aa) € Ele] 


called the splitting field of f over F'. Moreover, EF = F(a1,@2,...,Qa). 


Theorem A.15 — Extensions of Isomorphisms 


Let K, and Ko be extension fields of F' which has characteristic zero or is finite. Suppose 
that f;(2) € K;[z] with splitting field E,; for 7 = 1,2. If ¢ is an isomorphism of kK, and K»2 
with ¢(f1(x)) = fo(x), then ¢ can be extended to an isomorphism of E) to E2. 


Corollary A.10 If F is a field of characteristic zero or is finite with f(x) € Fla], then 
there exists a splitting field of f(a) which is unique up to F-isomorphism. In particular, 
any two algebraic closures are F'-isomorphic—see Definition 1.31 on page 37. 


Related to the above are the following fundamental facts. See Exercise 2.6 on page 63 for 
comparison and usage of notions surrounding these concepts. 


Theorem A.16 — The Number of Extensions of Isomorphims 


If & is an extension field of F' of finite degree |K : F'| = n, where F' has characteristic 
zero or is finite, and if LU is an algebraically closed field containing F’, then there are n 
F-isomorphisms of K into L. 


Theorem A.17 — The Primitive Element Theorem 


If K is an extension field of fF which has characteristic zero or is finite, then there exists 
a € K such that kK = F(a). 


Theorem A.18 — The Fundamental Theorem of Algebra 
If f(x) € C[a] and deg(f) =d EN, then f(x) factors into a product of d factors in C[a]. 
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+ Basic Matrix Theory 


If m,n € N, then an m x n matrix (read “m by n matrix”) is a rectangular array of entries 
with m rows and n columns. We will assume that the entries come from a commutative 
ring with identity R. If A is such a matrix, and a;,; denotes the entry in the i*” row and 
j* column, then 


Q1,1 a1,2 “Ain 

a2,1 a2,2 ete a2.n 
A= (44,5) = 

Qm,1 Am,2 “Amn 


Two m x n matrices A = (a;,;), and B = (b;,;) are equal if and only if a;,; = b;,; for all i 
and j. The matrix (a;,;) is called the transpose of A, denoted by 


At = (a;,:)- 


Addition of two m x n matrices A and B is done in the natural way, 
A+ B= (ai,5) + (01,5) = (aig + bi,5), 


and ifr € R, then rA = r(a;,;) = (ra;,;), called scalar multiplication. A scalar is a quantity 
that has magnitude, but not direction. This term comes from the vector space context, 
which we develop below. The case where the term scalar is used most often in practice is 
when R=R. 


Under the above definition of addition and scalar multiplication, the set of all mx n matrices 
with entries from R, a commutative ring with identity, form an R-module, denoted by 
Mmxn(R). If R is a division ring, then M,,..,(R) is a vector space over R. 


If A = (a;,;) is an m x n matrix and B = (6;,;) is an n x r matrix, then the product of A 
and B is defined as 


AB= (ai,5)(di,;) = (>: cats) 5 with 1 < a < m, and 1 <j < r. 
k=1 


When multiplication is defined, then it is associative, and distributive over addition. If 
m =n, then M;yx»(R) is a ring, with identity given by the n x n matrix: 


lr O 0 
O Ilr 0 

In = fs ) 
0 0 ie 


called the n x n identity matrix, where 1, is the identity of R. 


Another important aspect of matrices that we will need throughout the text is motivated 
by the following. We maintain the assumption that R is a commutative ring with identity. 
Let (a,b), (c,d) € Mix2(R). It is a straightforward exercise for the reader to verify that 
(a,b) and (c,d) are linearly independent vectors in Mj,9(R) if and only if ad — bc 4 0—see 
Definition A.7 on page 324. If we set up these row vectors into a single 2 x 2 matrix 


a b 
A=(0 a): 
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then ad — bc is called the determinant of A, denoted by det(A). More generally, we may 
define the determinant of any n x n matrix in Myx»(R) for any n € N. The determinant of 
any r € Mi x1(R) is just det(r) = r. Thus, we have the definitions for n = 1,2, and we may 
now give the general definition inductively. The definition of the determinant of a 3 x 3 
matrix 


Q11 41,2 41,3 
A= a21 42,2 42,3 
43,1 43,2 43,3 


is defined in terms of the above definition of the determinant of a 2 x 2 matrix, namely 
det(A) is given by 


43,2 43,3 43,1 43,3 43,1 43,2 


Therefore, we may inductively define the determinant of any n x nm matrix in this fashion. 
Assume that we have defined the determinant of an n x n matrix. Then we define the 
determinant of an (n + 1) x (n+ 1) matrix A = (a;,;) as follows. First, we let A;,; denote 
the n x n matrix obtained from A by deleting the i” row and j*” column. Then we define 
the minor of A;,; at position (i,j) to be det(A;,;). The cofactor of A;,; is defined to be 


cof (Ajj) = (—1)'*7 det(Aj,;). 
We may now define the determinant of A by 
det(A) = a;,1cof (Aji) + a;,2cof (Aj2) pe eseect Qi,n+1C0f (Ajn41)- (A.6) 


This is called the expansion of a determinant by cofactors along the i*” row of A. Similarly, 
we may expand along a column of A. 


det(A) = 1,5 cof( A, ,;) + a2,5 cof( A2,;) Sa Qn+1,j cof(An+41,5); 


called the cofactor expansion along the j*” column of A. Hence, a determinant may be 
viewed as a function that assigns a real number to an n x n matrix, and the above gives a 
method for finding that number. Other useful properties of determinants that we will have 
occasion to use in the text are given in the following. 


Theorem A.19 — Properties of Determinants 


Let R be a commutative ring with identity and let A = (a;,;), B = (b,;) € Mnxn(R). Then 
each of the following hold. 


(a) det(A) = det(a;,;) = det(a;,;,) = det(A‘). 
(b) det(AB) = det(A) det(B). 


(c) If matrix A is achieved from matrix B by interchanging two rows (or two columns), 
then det(A) = — det(B). 


(d) IfS, is the symmetric group on n letters, then 


det(A) = be (sgn(o))@1,0(1)@2,0(2) "*°An,a(n)s 


TEBn 


where sgn(c) is the sign of o given in Definition A.2 on page 320. 
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If A € Myxn(R), then A is said to be invertible, or nonsingular if there is a unique matrix 
denoted by 
A € Mya (R) 


such that 
AA!'=1,=A71A. 


Theorem A.20 — Properties of Invertible Matrices 

Let R be a commutative ring with identity, n € N, and A invertible in M,,,(R). Then 
each of the following holds. 

(a) (AV) T=A. 

(b) (At)~! = (A7)*, where “¢” denotes transpose. 


(c) det(A) is a unit in R. 


There is a special class of invertible matrices, which we will have occasion to use in the 
development of the basics in this text—for instance see Exercise 1.59 on page 54. 


Definition A.18 — General Linear Group and Unimodular Matrices 


If R is a field, or R = Z, then the totality of n x n nonsingular matrices with entries 
from R forms a group under matrix multiplication, called the general linear group, denoted 
by GL,,(R). In the case where R = Z, Theorem A.20 tells us that det(A) = +1 for any 
A€GL,,(Z). The matrices in GL,,(Z) are called unimodular. 


Another important fact is contained in the sequel, a result which follows from cofactor 
expansions—see Biography B.2 on page 351 for some ironies of attribution in this regard. 


Theorem A.21 — Cramer’s Rule 


Let A = (a;,;) be the coefficient matriz of the following system of n linear equations in n 
unknowns: 
Q1,101 7 41,272 Tes' TaA1ntn = by 


a2171 1+ a2,272 T*** Ta2ntIn = be 


An 101 + An ,202 Sees an ntn = bn, 


over a field F. If det(A) 4 0, then the system has a solution given by: 


es iaciy (S-n 2ei(a,)) , <Sj<n). 


We may also determine the inverse of a nonsingular matrix via a notion related to the 
development of Cramer’s Rule. 
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Definition A.19 — Adjoint 
Let R be a commutative ring with identity. If A = (a;,;) € Mnxn(R), then the matrix 
A® = (04,3) 
given by ee 
bij = (—1)'*! det(A;,i) 
is called the adjoint of A. 


Some properties of adjoints related to inverses are as follows. 


Theorem A.22 — Properties of Adjoints 


If R is a commutative ring with identity and A € My y,(R), then each of the following 
holds. 


(a) AA® = det(A)I, = A®A. 
(b) A is invertible in M,,,(R) if and only if det(A) is a unit in R, in which case 
A@ 


ce; as 
ae det(A)’ 


Note that when R is a field in Theorem A.22, then det(A) is a unit if and only if det(A) 4 0. 


The following facts will also prove to be useful in the text—see the proof of Theorem 2.2 
on page 58, for instance. 
Theorem A.23 — Systems of Linear Homogeneous Equations 


A system of m linear equations in n unknowns x; over a field F 


01,101 + 41,2%2 +°+++ 41 nt = by 


Am 1% ate Am,2V2 apse Aamntn = bm 
has a (simultaneous) solution if and only if the matrix equation AX = B has a solution 
X, where A = (a;,;) € Mmxn(F), X = (i) € Mnxi(F), and B = (6;) € Mnxi(F). The 
system AX = B is called a homogeneous system of linear equations if B = (0) € Mnxi(F) 
is the zero vector. If m < n, then AX = 0 has a nontrivial solution, that is to say, one 
for which not all x; = 0. In this case, there are elements c; € F not all zero such that 


Sa ay ji = 0. 
We introduced linear transformations on page 328. We now define an associated matrix. 


Definition A.20 — Matrix of a Transformation 


Suppose that ~ : V+ V is a linear transformation of a vector space V over a field F’. If 
{U1,-.+,Un} is a basis for V over F, then the matrix of w is given by (aj,;), where a;,; € F 
are uniquely determined by 


(vj) = ye 4G U5 (1 << a < n). 
j=l 


The determinant of the linear transformation is denoted by det(i) = det(a;,;). 
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Lastly, for this section on matrices, we define the following, which we will need as a tool in 
the main text—see the proof of Theorem 5.13 on page 215, for instance. 


Definition A.21 Kronecker Products 


If A = (ai,;) € Mexr(F) and B = (84,0) € Msxs(F) for a field F', then the Kronecker 
product of A and B, denoted by A x B, is obtained by taking the matrix B = (6,,¢) and 
replacing each entry 8,,¢ by the r x r matrix Bx,¢(ai,;). 


Now we have a smattering of concepts that we will require and we put them under their 
own headings. 


+ The Arithmetic-Geometric Mean Inequality 


We will have need of the following classical result in the text. See [28, Theorem 5.2, p. 544] 
for a proof. Ifn € N and 2; € Rt for j =1,2,...,n, then 


1/n 


+ Stirling’s Formula 


Pee ee A.7 
n! 27n en 
for some a in the interval (0,1). This is a version that will be suitable for our purposes in 
this text—see the proof of Corollary 3.7 on page 116, for instance. 
Another important fact is from the theory of sets. 
+ Zorn’s Lemma 
Suppose that $' is a linearly ordered“® family of sets that is closed with respect to taking 
unions. In other words, for any number of 8; € S (possibly infinitely many), U,;8,; € S. 
Then S has a maximal element. Zorn’s Lemma is equivalent to the Axiom of Choice (see 
(50, p. 367], for instance). 


Biography A.1 Max Zorn (1906-1993) was born on June 6, 1906 in Germany. 
He received his doctorate from Hamburg in 1930 under the direction of Artin 
(see Biography 1.2 on page 24). He was then appointed to Halle in 1933. 
However, he left Germany because of the Nazis. He worked at Yale from 
1934 to 1936. It was during this period that he produced what we now call 
Zorn’s lemma. He then spent ten years in California, after which he moved to 
Indiana, where he became a Professor. Perhaps his most famous student was 
Israel Nathan Herstein (1923-1988). Zorn did work, not only in set theory, 
but also in topology and algebra. One of his other classical results was the 
proof that the Cayley numbers are unique in the sense that they form the only 
alternative, quadratic, real nonassociative algebra without zero divisors. He 
died on March 9, 1993. 


A-8Recall that a linear order is a binary relation R on a set S such that the following three conditions are 
satisfied. 
(1) aRb, or bRa for all a,beS, with a distinct from 8, 
(2) aRa for no aes, and 
(3) if arb, and bRe, then akc. 
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Biography A.2 James Stirling (1692-1770), was born in May 1692 in Garden 
(near Stirling), Scotland, and educated at Glasgow. In 1717, he published 
his first work Linae Terti Ordinis Neutonianae, extending Newton’s theory of 
plane curves by classifying cubic curves. He was elected to the Royal Society 
in London in 1726, and in 1730, he published, as Example 2 of Proposition 
28, in the Methodus differentialis, the approximation n!~V2rn(2)", in the same 
year as Abraham de Moivre (1657-1705) published his Miscellanea analytica. 
There is a certain consensus among mathematical historians that de Moivre 
knew a version of formula (A.7) earlier than Stirling. De Moivre used many 
such formulas in his research in probability theory. For instance, de Moivre 
was ostensibly the first to work with the probability formula f° e~** de=7/2, 
which appeared in 1733 in a privately printed paper entitled Approximatio ad 
summam terminorum binomi (a +b)” in seriem expansi. In 1735, Stirling 
returned to Scotland, and became manager of the Scotch Mining Company 
at Leadhills. In 1746, he was elected to the Royal Society of Berlin. In that 
same year Colin Maclaurin (1698-1746) died, and Stirling was offered his chair 
at Edinburgh, but he declined. He is also known for numbers called Stirling 
numbers, which have to do with permutations of lists of numbers. Stirling died 
on December 5, 1770 in Edinburgh. 


We also remind the reader of the following elementary, albeit important facts. 


+ Dirichlet’s Box Principle 


If more than n € N objects are placed in n boxes, then at least one of the boxes contains 
more than one element. 


This is also called the Pigeonhole Principle based upon the application of n + 1 pigeons 
flying into n holes. 
+ The Well-Ordering Principle 


Every non-empty subset of N contains a least element. 
It can be shown that the Well-Ordering principle is logically equivalent to following—see 
(53, Exercise 1.3, p. 11]. 


+ The Principle of Mathematical Induction 
Suppose that S C N and both (a) and (b) below hold. 


(a) 1 € 8, and 
(b) Ifm >1landn—1€5, thenne S$. 
Then § =N. 


The following will be useful in text—see the solution of Exercise 6.1 on page 401 for instance. 


Theorem A.24 Solutions of Linear Congruences 


For a,be€ ZandneN, 
ax =b (mod n) (A.8) 


has a solution x € Z if and only if g = gcd(a,n) | b. Furthermore, if such a solution exists, 
then there are exactly g incongruent solutions modulo n and exactly one of these is in the 
least residue system modulo n/g, this being the unique solution to (A.8). 
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The following result was first proved by Leibniz and has a familiar consequence as a special 
case, which is used throughout the text—see the proof of the cubic reciprocity law in 
Theorem 6.1 on page 267, and the proof of the biquadratic reciprocity law in Theorem 6.5 
on page 282, for instance. 


+ The Multinomial Theorem 


Theorem A.25 Let R be a commutative ring with identity, and let m,n € N with m > 1. 
If a,,a2,...,@m € R, then 


nial a? «++ adm 
MA oa! Ay 2D, m 
(ay tag +---+ am) = S- Filjal ara! ’ 
(91,92)--,5m) 
where the sum ranges over all m-tuples (j1,j2,-.-,;jm) of nonnegative integers j; with 


jithate tim =n. 


Corollary A.11 — The Binomial Theorem 
Let R be a commutative ring with identity, a,b € R, and n € N. Then 


a n : 
a+b)" = ("Jaton 
(a+b)"=)> j 


j=0 


where 


is the binomial coefficient. 


Biography A.3 Gottfried Wilhelm von Leibniz (1646-1716), was born on 
July 1, 1646 in Leipzig, Saxony (now Germany). By the age of twelve, he 
had taught himself Latin and Greek in order to be able to read the books of 
his father, who was a philosophy professor at Leipzig. Leibniz studied law at 
Leipzig from 1661 to 1666 and ultimately received a doctorate in law from the 
University of Altdorf in 1667. He pursued a career in law at the courts of Mainz 
from 1667 to 1672. Then he went to Paris from 1672 to 1676, during which time 
he studied mathematics and physics under Christian Huygens (1629-1695). In 
1676, he left for Hannover, Hanover (now Germany), where he remained for 
the balance of his life. Leibniz began looking for a uniform and useful notation 
for the calculus in 1673. In 1684, he published the details of the differential 
calculus, the year before Newton published his famed Principia. The bitter 
dispute between Newton and Leibniz concerning priority over the discovery 
of the calculus is detailed in [50, pp. 234-235]. In 1700, Leibniz founded 
the Berlin Academy and was its first president. Then he became increasingly 
reclusive until his death in Hannover on November 14, 1716. 


The following will be of use in the text—see the proof of Theorem 5.8 on page 201, for 
instance. 
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+ The Lagrange Interpolation Formula 


Theorem A.26 Let F be a field, and let a; for 7 = 0,1,2,...,n be distinct elements of 
F. Ife; for 7 =0,1,2,...,n are any elements of F’, then 


is the unique polynomial in F'[z] such that f(a;) =, for all 7 =0,1,...,n. 


For ease of reference and convenience, the reader is reminded of the following definitions. 
+ Some Elementary Number Theory 


We remind the reader of the Definition of Euer’s totient for convenience. 


Definition A.22 — Euler’s Totient 


For any n € N the Euler totient, also known as Euler’s d-function, ¢(n) is defined to be the 
number of m € N such that m <n and gced(m,n) = 1. 


Definition A.23 — The Legendre Symbol 
If c€ Z and p > 2 is prime such that p{c, then the Legendre symbol is given by: 


(<) _ fil if c is a quadratic residue modulo p, 
~ | +1 otherwise. 


Note that the above implies 
(<) = c-Y/2 (mod p). (A.9) 
Also, we have for the c = 2 case that 
Z (p?-1)/8 
—}=(-1)” ; (A.10) 
Pp 
As well, 


CE ee aaa (an 


Definition A.24 — The Jacobi Symbol 


Let n € N, n > 1 be odd, and c € Z with gced(c,n) = 1. Suppose that n = Tj Pi where 
the p; are (not necessarily distinct) primes. Then the Jacobi symbol is 


(¢)-I1(£): 


where the right-hand symbols are Legendre symbols. 
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Definition A.25 — The Kronecker Symbol 
Suppose that n € N andA = is the discriminant of a quadratic number field—see Defin- 
tion 1.33 on page 46. The Kronecker symbol (42) is given by 


(2) 


Ar _ 1 if FL 1 (mod 8), 
2) | -1 if &=5 (mod 8). 


if gced(Apr,n) > 1, and 


A 
(==) is the Legendre symbol for any prime p > 2. 
Pp 


A 
(=) is the Jacobi symbol if n is odd and gcd(n, Ar) = 1. 
n 


If n = 2°m where m is odd, then 


(eal) Ge 


) is the Jacobi symbol. 


where (42 
m 
Also, of value in the text is the following result on representation of natural numbers—see 
(53, Corollary 6.1, p. 245], for instance. 


Theorem A.27 A natural number n can be represented as the sum of two integer squares 
if and only if every prime factor of the form p = —1(mod 4) appears to an even power in 
the canonical prime factorization of n. 


We conclude this appendix with a statement of the following celebrated result. Gauss first 
studied the number of primes less than x, denoted by 7(a). He observed that as x increases 
a(x) behaves akin to 2/log,.(a). Therefore, he conjectured in 1793, at the age of sixteen, 
that the following holds. 


Theorem A.28 — The Prime Number Theorem 


lim mie) 


en et ee | 
ree @/ log, (2) 


% 


denoted by 
T(x) ~ a/ log. (a).4° 


A.9Tn general, if f and g are functions of a real variable x, then f(x) ~ g(x) means limz_+5o f(x)/g(a) = 1. 
Such functions are said to be asymptotic. 
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Although Riemann had given an outline of a proof for Theorem A.28, the necessary tools had 
not yet been developed. This outline was one of the major motivations for the development 
of complex analysis from 1851 until the first proofs were given in 1896 independently by 
Hadamard, and Poussin. 


There are better approximations to m(a) such as the logarithmic integral 


life) =f dt/ 1080), 


which Gauss also conjectured, after postulating the validity of Theorem A.28. 


Biography A.4 Charles Jean Gustave Nicholas De La Vallée Poussin (1866— 
1962) was born in Louvain, Belgium on August 14, 1866. In 1891, he became 
an assistant at the University of Louvain, where he worked with Louis Claude 
Gilbert, one of his former teachers. Gilbert died at the age of twenty-six, and 
Poussin was elected to his chair in 1893. He held that chair for the next fifty 
years. He is perhaps best known for his proof of the Prime Number Theorem 
in 1896, and his important, fundamental textbook Cours d’analyse, which saw 
several editions. However, the text contained no complex function theory. 
Poussin did turn to the theory of complex variables after 1925. He wrote Le 
potential logarithmique, which was published after the war in 1949. He died on 
March 2, 1962 in Louvain. 


Biography A.5 Jacques Salomon Hadamard (1865-1963) was born in Ver- 
sailles, France on December 8, 1865. He studied at the Ecole Normale 
Supérieure, where Emile Picard was one of his teachers. He obtained his doc- 
torate in 1892 on the topic of functions defined by Taylor series. Hadamard was 
elected to a chair at Paris where he discovered his proof of the prime number 
theorem. This proof was only part of his work in complex analysis. He is cred- 
ited with approximately three-hundred publications including contributions to 
the theory of integral functions and singularities of functions represented by 
Taylor series, as well as a generalization of Green’s functions. Hadamard was 
also deeply involved with politics. A relation of his, Alfred Dreyfus, who was 
a French army officer, was tried for treason. This began a controversy that 
lasted over a decade, and became known as the Dreyfus Affair, which scarred 
the history of the French Third Republic. Hadamard actively participated in 
clearing Dreyfus’s name. This occurred on July 22, 1906, when Dreyfus was 
exonorated and decorated with the Legion of Honour—see [52, p. 77] for an 
overview of the Dreyfus scandal and the surrounding issues. Hadamard lost 
two of his sons in World War I after which his politics moved to the left, partly 
in response to the rise of Nazi power. After France fell in 1940, Hadamard 
left for the United States, but returned to Paris in 1944. After World War II, 
he became an active peace campaigner. He died just before his ninety-eighth 
birthday on October 17, 1963 in Paris. 
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Appendix B 


Sequences and Series 


Simplicity is the ultimate sophistication. 


Leonardo da Vinci (1452-1519) 
Florentine painter, sculptor, architect, engineer, inventor 


We look at the important fundamental notions behind sequences and series as they will be 
needed in the main text. The proofs for most of what follows may be found in any standard 
first- or second- year calculus text. 


Definition B.1 — Sequences 


A sequence is a function whose domain is N, with images denoted by a,, called the n*” 
term of the sequence. The entire sequence is denoted by {a,,}°°,, or simply {a,,}, called an 
infinite sequence or simply a sequence. If {a,} is a sequence, and L € R such that 


lim a, = L, 
noo 


then the sequence is said to converge, whereas sequences that have no such limit are said 
to diverge. If the terms of the sequence are nondecreasing, aj, < an+1 for all n € N, or 
nonincreasing, @) > Gn+41 for all n € N, then {a,} is said to be monotonic. A sequence 
{a,,} is called bounded above if there exists an M € R such that a, < M for alln € N. The 
value M is called an upper bound for the sequence. A sequence {a,,} is called bounded below 
if there is an B € R such that B < a, for all n € N, and B is called a lower bound for the 
sequence. A sequence {a,,} is called bounded if it is bounded above and bounded below. 


Some fundamental facts concerning sequences are contained in the following. 


Theorem B.1 — Properties of Sequences 
Let {an} and {b,} be sequences. Then 
(a) If {a,} is bounded and monotonic, then it converges. 


(b) If limp+o Gn = limp +o bn = L ER, and {c,} is a sequence such that there exists an 
N EN with a, < cy < by, for alln > N, then limy +o Cc, = L. 


(c) If limp +o |an| = 0, then limy 5.6 ay = 0. 


Definition B.2 — Infinite Series®-! 


If {a;} is an infinite sequence, then 
Co 
dy 
j=l 


B.1One may trace infinite series back to Archimedes (287-212 B.C.). He established a result on the 
quadrature of the parabola, thereby essentially proving that the series Doped 4-J converges. 
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is called an infinite series. The sum 
n 
Sn = ) a; 
j=l 


is called the n*” partial sum of the series. The series is said to converge if the sequence 
{S,,} converges, and it is said to diverge if the sequence diverges. If the series converges, 
then limp_+o0 Sn = S € R is called the sum of the series, denoted by 


S = Se 
g=1 


An infinite series of the form 
foe) 
yore (a,r € R,a,r 40) 
j=0 


is called a geometric series with ratio r. 


Theorem B.2 — Properties of Infinite Series 
Let )0j2, aj and ))°*, 6; be infinite series. Then each of the following hold. 


(a) If Se a; converges, then the sequence {a;}°, converges to 0. 
(b) Ifc eR is constant, then )77", cay = c 05°, aj. 


(c) If pase ay = S;, € R, and 4 b; = Sp ER, then Dye (4 + b;) = $1, + So. 


Remark B.1 If an infinite series is convergent, then one may remove or insert any finite 
number of terms without affecting its convergence. Also, one may group the terms of the 
series in brackets, without altering the order of the terms, and the resulting series converges 
to the same sum. However, the converse of the last statement is false. In other words, 
one cannot remove brackets and have a series that necessarily converges. For instance, the 
infinite series (1— 1)+(1—1)+--- is convergent, but the series obtained by removing the 
brackets is not. Hence, brackets may be inserted without affecting convergence, but may 
not be removed—see Remark B.2 on page 349. 


We now look at some well-known tests for convergence. 


Theorem B.3 — Integral Test for Convergence 


If f is a positive, continuous, decreasing function of a real variable x > 1 and a; = f(J), 


then te 
S 4; and i} f(a)dx 
j=l ; 


either both converge or both diverge. 
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Theorem B.4 — Convergence of Geometric Series 


The geometric series 
Co 
y ar! 
j=0 


diverges if |r| > 1. If 0 < |r| < 1, the series converges to the sum 


= a 

y ari = : 
; 1l-r 
j=0 


Also, for any r #1, the n*” partial sum of the geometric series is given by 


Sg GEO) 
Sn, = J — 
D ar = 
j=0 
Theorem B.5 — Direct Comparison Test 


Suppose that }5", a; and 57>", by are infinite series such that 0 < aj < 0; for all j € N. 
Then 


Co Co 
y a; converges if y b; converges. 


jet a1 


Note that the contrapositive is: 


S- b; diverges if a a; diverges. 


j=l j=l 


Definition B.3 — Harmonic Series and p-Series 


A series of the form 


1 1 .-t 1 
he 
aad J 
is called a harmonic series. 
Theorem B.6 — Convergence of p-Series 


The series i a converges for p > 1 and diverges for 0 < p< 1. 


Theorem B.7 — Limit Comparison Test 


Let aj > 0, b; > 0 and 
lim () =LeRt. 
n— oo b; 


Then }75-, a; and ))" , 6; both converge, or they both diverge. 
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Definition B.4 — Alternating Series 
An infinite series of the form 


(-1)%a; or $0(-1)*4a;, 


gai j=l 


with a; > 0, is called an alternating series. 


Biography B.1 The fact that the harmonic series diverges was first discov- 
ered by Nicolae Oresme in the fourteenth century. His birth year is unknown 
with any degree of certainty by historians, but most set it at 1323. There is also 
some disagreement as to where he was born, but most data supports that it was 
somewhere near Caen, France, if not in Caen itself. He was a Parisian scholar, 
who studied theology, and became bursar at the University of Paris. Later he 
became canon, then dean of Rouen. In 1370, he was appointed chaplain to King 
Charles V, advising the King on spiritual as well as financial matters. Oresme 
was also the among the first, even before Descarte, to use a coordinate system 
for graphing, incorporating such ideas as velocity-time graphs. He suggested a 
three-dimensional generalization of the concept in his work Tractatus de Figu- 
ratione Potentarium et Mensurarum. The work contained implicit suggestions 
of a four-dimensional geometry, but analytic geometry was not yet developed 
to take him further. In his Algorismus Proportionum, Oresme developed the 
idea of fractional powers. He suggested that irrational exponents such as, in 
modern notation, av? are possible. This was undoubtedly the first appearance 
of the notion of a higher transcendental function. However, he did not have 
enough development of either notation or terminology to take the notion very 
far. Oresme’s ingenious idea for proving the divergence of the harmonic series 
was to group successive terms in the series placing the first term in the first 
group, the next two terms in the second group, the next four in the third group, 
continuing so that the n‘” group has 2"~! terms. He then deduced that since 
there are infinitely many groups with each group having a sum of at least 1/2, 
then adding together enough terms, one can achieve a number larger than any 
given number. Oresme ultimately became the Bishop of Lisieux, France and 
died there on July 11, 1382. 


Theorem B.8 — Alternating Series Test 
The alternating series 


So(-1)'a; and So (-1)'*14,;, with a; > 0 for all j ¢ N 


j=l j=l 


converge if both lim;_,.. aj = 0 and a; > ag > a3 >---. 


Theorem B.9 — Absolute and Conditional Convergence 


If the series 
Co 


S— Ia;| 


jai 
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converges, then )7* , 
other hand, if }°; aj converges, but >>", |a;| diverges, we say that }7°°, a; is condition- 
ally convergent. 


a; converges and we call i a; is absolutely convergent. On the 


Remark B.2 If an infinite series is absolutely convergent, then its terms can be rearranged 
in any order without changing the sum of the series. On the other hand, if a series is 
conditionally convergent, then the series can be rearranged to give a different sum. 


Theorem B.10 — Ratio Test 
If ei a, is an infinite series, with a; 4 0 for all 7 € N, then each of the following holds. 


(a) °°, a; is absolutely convergent if 


j=l 
Tima: | eet 
jg-oco aj 
(b) Soj2, a; diverges if 
lim SFE SS or lim Th) — 
Joo aj Joo a; 
Theorem B.11 — Root Test 


Let ot a; be an infinite series. Then each of the following holds. 
(a) If limjoo ¥/Jaj| < 1, then 577°, a; is absolutely convergent. 
(b) If limjsoo ¥la;| > 1 or limj+o. ¥/la;| = 00, then 95°, a; diverges. 


The simplest and most important of the infinite series are the following, with which we will 
be most concerned in the main text. 


+ Power Series 


Definition B.5 — Power Series 


If x is a real variable, then 
be a;(x — ce) 
j=0 


is called a power series centered at c€ R. 


Theorem B.12 — Convergence of Power Series 


If oF a(x — c)’ is a power series, then exactly one of the following holds. 
(a) The series is absolutely convergent for all x € R. 
(b) The series converges only for x = c. 


(c) There exists an R € R such that the series is absolutely convergent for |x — cl < R, 
and diverges for |a —c| > R. 
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The value R is called the radius of convergence of the series. Thus, in part (a), R = co 
and in part (b), R = 0. In part (c), the real interval (c — R,c+ R) is called the interval of 
convergence of the series. 


Theorem B.13.  — Abel’s Theorem 


If the radius of convergence of the power series 75°) a;x/ is R and 


Co 
RI 
) ajR 
j=0 
is convergent, then 


Co [oe) 
lim y aja) | = ) a,;R?. 
z>R - . 

j=0 j=0 


Corollary B.12 If R= 1 and ear aj; is convergent, then 


foe) [oe) 
lim ) a;zI | = ) a 
al : J = J 

j=0 j=0 


Application B.1 — Hyperbolic Tangent 


Consider the infinite series 1 — ¢? + t4 — t® +--- the sum of which for |t| < 1 is (1+#?)71. 
Integrating termwise for —1 < x < 1, we get 


anes 4 dt x 
arctan(x) = =f 
9 L+#? 3 5 


which implies that 


li t =tan7*(1)=—=1 
lim are an(az) = tan” “(1) ri gts 


which is the formula for 7 discovered by Gregory—see Biography B.2 on the facing page. 
Of particular importance is the following. 
Theorem B.14 — Taylor and Maclaurin Polynomials and Series 


If a function f of a real variable x has derivatives f“ for j = 1,2,...n at c, then 


J 


nr P(e | 
P(e) = Oe — 0) 
j=0 


Taylor polynomial for f atc. If ¢ = 0, then it is called the n*” Maclaurin 


is called the n‘” 
polynomial for f. 


If f has derivatives of all orders at c, then the series 


© A (¢ 


is called the Taylor series for f atc. If c= 0, then the series is called the Maclaurin series 


for f. 
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Biography B.2 Colin Maclaurin (1698-1746) is known today almost exclu- 
sively for the series that bears his name, namely f(x) = De f (0) a /4!, 
where f is the j‘” derivative of the function f—see Theorem B.14. This 
series appeared in his Treatise of Fluxions in 1742. However, it is a special 
case of the more general Taylor series published by the secretary of the Royal 
Society, Brook Taylor (1685-1731) in his Methodus Incrementorum of 1715. 
However, this series was known long before by the Scotsman James Gregory 
(1638-1675), although Taylor was not aware of this. Furthermore, the se- 
ries appeared in Methodus differentialis by Stirling more than a decade before 
Maclaurin’s publication—see Biography A.2 on page 340. There is also evi- 
dence that this series was known to Indian mathematicians such as Kelallur 
Nilakantha Somayaji (1444-1544). It is somewhat ironic that Maclaurin is 
known for the above series, when he had deep results of his own in geometry. 
Maclaurin is considered by many historians to be the most outstanding of the 
generation of British mathematicians after Newton. He was born in Argyll- 
shire, Scotland, and was educated at Glasgow. He was Professor at Marischal 
College, Aberdeen from 1717 to 1725, then at the University of Edinburgh from 
1725 until 1745. In 1740, he shared a prize from the Académie des Sciences, 
with Euler and Daniel Bernoulli, for a study of tides—see Biography 4.7 on 
page 161. The irony of attribution is compounded by the fact that a discov- 
ery made by Maclaurin in 1729 is credited to Gabriel Cramer (1704—1752)—-see 
Theorem A.21 on page 337. Maclaurin was also actively involved in the defense 
of Edinburgh during the Jacobite rebellion of 1745, and fled the city for York 
when it fell to “Bonnie Prince Charlie.” The war in the trenches had taken its 
toll on him however. He died the next year on June 14, 1746 in Edinburgh. 
Maclaurin’s Treatise of Algebra was published posthumously in 1748. 


Definition B.6 — Remainder of a Taylor Polynomial 


If f is a function of a real variable x and P,,(x) is the n*” Taylor polynomial for f at c, then 
R(x) = f(#) — P(x) 
is called the n*” remainder of f(z). 


Lagrange’s form of the remainder of a Taylor polynomial is given in the following—see 
Biography 3.3 on page 93. 


Theorem B.15 — Taylor’s Theorem 


If f is a function such that f% exists for 7 = 1,2,...,n +1 in an interval I containing c, 
then for all x € I, there exists a z between x and c such that 


2 £)(¢ : 
fe) =, Se — 0 + Raa), 


joo 


where 
= jeez) n+l 
R,(«) = Gah! (g—o)", 
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Theorem B.16 — Convergence of a Taylor Series 
Let f be a function having derivatives of all orders in an open interval J centered at c. Then 


f(@) = 


for all z € I. 


The following formulas will be very useful throughout the main text. 
+ — Power Series for some Elementary Functions 


In what follows, the interval for x given for each series is the interval of convergence. 


B.l. £ = 7%,(-1)4(@ — 1)7 = 1- (w@ — 1) + (w— 1)? —--- with x € (0,2). 
B.2. log(1 +2) =e, GY eg fy et _... with @ € (-1,1). 


B3. e=° f=lt+e+ H+ Ht withweR. 


we : 
B.4. sin(a#) = pe, ae = 2 oa eer oe with oe Ri 

B.5. cos(z) = 2, SY =1- 8 +4... witha eR. 

B.6. eM) =14+ 24% — Set _ Soh _ Be 5 S60 L . with 2 ER. 


The following notion brings series into the realm of the complex numbers C and allows us 
to introduce important fundamental results to be used in the main text. For instance, we 
presented the connection with Bernoulli numbers in Theorem 4.5 on page 156. 


Definition B.7 — Dirichlet Series—The Riemann Zeta Function 


Dirichlet series is an infinite series of the form 
CO 
—sxr - 
) aje ae 
j=1 


where a;,s € C, A; € R, and the sequence {A,;} tends monotonically to infinity. In partic- 
ular, if A; = log(j), and a; = 1 for all 7 € N, then 


¢(s) = re 


is called the Riemann zeta function—see page 155. 
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Biography B.3 Georg Friedrich Bernhard Riemann (1826-1866) was born 
on September 17, 1826 in Breselenz, Hanover (now Germany). He was the 
son of a Lutheran pastor, and his family was relatively poor. Moreover, he was 
physically frail, but what he lacked in physical strength, he more than made up 
in intellectual acuity. Furthermore, his lack of financial wealth did not prevent 
him from getting a strong education. In 1846, he studied under Jacobi, Dirich- 
let, and Eisenstein at Berlin, went to Gottingen in 1849 to study under Gauss, 
and achieved his Ph.D. in 1851. In 1854, he became Privatdozent at the Univer- 
sity of Gottingen. His Habilitationschrift or inaugural dissertation was given on 
his thesis entitled Uber die Hypothesen welche der Geometrie zu Grunde liegen 
or On the hypotheses which lie at the foundation of geometry. This presented 
such a deep general perception of geometry that its results ultimately made way 
for Einstein’s theory of general relativity, since Riemann proposed the general 
study of curved metric spaces, rather than geometry on a sphere. His ultimate 
contributions to theoretical physics were deep, and long-lasting. In 1859, after 
the death of Dirichlet, Riemann was appointed to fill his chair at Gottingen. 
Riemann’s name is attached to a host of mathematical objects and theorems 
including the Riemann integral, the Riemann surface, Riemannian geometry, 
the Riemann mapping theorem, Riemann manifolds, and the still unresolved 
Riemann hypothesis—see Hypothesis B.1 on the following page—to mention a 
few. Riemann married at the age of thirty-six in 1862. The following month he 
became ill with pleurisy, which ultimately turned into pulmonary tuberculosis. 
He travelled to Italy several times to enjoy the milder climate and recover. On 
his final trip, he went to a villa at Selasca, Lake Maggiore in Italy. He died 
with his wife by his side on July 20, 1866. 


The Riemann zeta function converges for #(s) > 1. If R(s) > 1, then the following is called 
Euler’s identity 


¢(s)=[Ja-p’y, 
Pp 

where the product runs over all primes p. Although we shall not explicitly need the following 
facts in the text, we state them here for the reader with knowledge of complex analysis. The 
function ¢(s) is a holomorphic function in the half plane R(s) > 1, and can be continued 
analytically to a meromorphic function on the whole plane. Its unique singularity is the 
point s = 1 at which it has a simple pole with residue 1. Riemann proved the above in 
1859. 


There is also a classical connection of the zeta function with the following concept. 


Definition B.8 — Gamma Functions 


The gamma function is given by 


where ; ; 


is called Euler’s constant. For #(s) > 0, we may write 


re) = [ ge da. 
0 
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The following application will be useful in text. The Gamma function is used to verify the 
following—see [54, Exercise 10.13, p. 345], for instance. 


Application B.2 —Infinite Product Expansion for Sine 


= i 
sin(a) = x Il (1 - =a) j 
n=1 


The following relationship between the zeta function and the gamma function is called 
the functional equation of the zeta function, proved by Riemann in 1859. 


G(s) = 2x1 sin (SJ s)¢(1—s). 


As a result of the functional equation, it is known that all the nontrivial zeros of ¢(s) 
(namely those for which s 4 —2,—4,—6...) must lie in the critical strip 0 < R(s) < 1, and 
that they are located symmetrically about the critical line R(s) = 1/2. However, heretofore 
nobody has been able to prove Riemann’s contention: 


Hypothesis B.1 — The Riemann Hypothesis 
¢(s) £0 for any s with R(s) > 1/2. 


Proofs of the above results may be found in [54, Theorem 5.15, p. 225] and the discussion 
surrounding it therein. 

Some of the history behind the zeta function is worthy of note. In 1731, Euler had done 
calculations that allowed him to conclude that ¢(2) ~ 1.644934, and later got stronger 
approximations. By 1734, Euler had proved that 


¢(2) = 17/6. 
He had communicated with Daniel Bernoulli on his early successes in 1734. Daniel found 
Euler’s discoveries to be “remarkable.” In 1737, after the successful calculation of ¢(2) under 
his belt, Euler published Variae Observationes circa series infinitas, which contained the 
now-famous Euler product: 
98 .38.55.78. 118... 
¢(s) — Ss s s s s . 
(2° — 1)(3* — 1)(5§ — 1)(78 — 1)(118 — 1)--- 


Also contained in that paper, he established (in modern notation) that as x — co, then 


SF = logloe(«) + C4 o( : i; 


mee log(x) 


where C is a constant.?? By 1740 Euler had determined ¢(2n) for any n € N—see The- 
orem 4.5 on page 156. However, for odd integers n, ¢(n) remains a mystery. In attempts 
to solve the problem, Euler was led to the following result, which he published in a paper 
called Exercitationes Analyticae in 1772, 


ee 7 ae 
1+ 33 + 53 prs A. log(2) + 2 f x log(sin(x))dx. 

What is all the more amazing is that Euler was blind by this time and all the calculations 

were done mentally. For further details on the life of Leonard Euler—see Biography 4.4 on 

page 148. 


B.2We remind the reader that the big O notation is defined by f(x) = O(g(x)), for positive real-valued 
functions f and g, provided there exists an r € R such that f(x) < rg(a). 
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The Greek Alphabet 


Timeo Danaos et dona ferentes: I fear the Greeks though bearing gifts. 
Virgil (Publius Vergilius Maro) (70-19 B.C.) 
Roman poet 


Capital 


Lower-case 


English transliteration 


Pronunciation 


ZexmeByMytoOmees Rr ORnypone 


ERK OHECAQDAOCMUE HARB BINA GWE. 


oZgeBwrunntorx phere fanoan oe 


alpha 
beta 
gamma 
delta 
epsilon 
zeta 
eta 
theta 
iota 
kappa 
lambda 
mu as in mew 
nu as in new 
xi as in ksee 
omicron 
pi as in pie 
rho as in row 
sigma 
tau as in towel 
upsilon 
phi as in fee 
chi as in cheye 
psi as in psee 
omega 


The é denotes a long e as in see, as opposed to the short e as in bed. The symbol 0 is 
used here to mean an o as in boring, somewhat longer than the o in omicron, but not as 
long as the long o in too. The pronunciations given here are those used by English-speaking 
people. The Greeks have (sometimes) different pronunciations for the letters. For instance, 
the Greeks pronunciation of @ is the same as given above, but the Greek pronunciation of 
B is vita. Thus, the difference between the conventional ones, given in the column above, 


may vary from the real ones used by the Greeks themselves. 


C-14T am the Alpha and the Omega, the first and the last, the beginning and the end.” (Revelation 22:VII) 


of the Holy Bible. 
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Latin Phrases 


amicus certus in re incerta cernitur: a true friend is certain when certainty is uncer- 
tain—i.e., a friend in need is a friend indeed. 

Latin proverb spoken by Ennius (239-169 B.C.) 

Roman writer 


Latin Phrase 


English equivalent 


abeunt studia in mores 

ab uno disce omnes 

ad arbitrium 

ad extremum 

ad hoc 

ad infinitum 
ad libitum (ad lib) 

aere perennius 

a fortiori 


alea jacta est 
a maximis ad minima 
animis opibusque parati 
arrectis auribus 
aurea mediocritas 
bonis avibus 
cadit quaestio 


cetera desunt 
cogito ergo sum 
divide et impera 
docendo discimus 
ecce signum 
e contrario 
exempli gratia (e.g.) 
e pluribus unum 
et sic de similibus 
excelsior 
exceptis excipiendis 
ex necessitate rei 
ex nihilo nihil fit 
ex vi termini 
facile princeps 
finem respice 


practices, zealously pursued, pass into habits 
from one, learn to know all 
at will (arbitrarily) 
to the extreme (at last) 
to this (for a particular purpose) 
without limit (to infinity) 
improvise 
more lasting than bronze 
from the stronger (argument) 
meaning: with greater reason — 
used in drawing a conclusion that 
is deemed to be even more certain than another 
the die is cast 
from the greatest to the least 
prepared in mind and resources 
with pricked-up ears (attentively) 
the golden mean 
under good auspices 
the question drops, 
meaning the argument fails 
the rest is missing 
I think, therefore I am (exist) 
divide and rule 
we learn by teaching 
behold the sign (look at the proof) 
on the contrary 
for example 
one out of many 
and so of like things 
still higher 
with the necessary exceptions 
from the necessity of the case 
from nothing comes nothing 
from the force of the term 
easily first 
consider the end 
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Latin Phrase 


English equivalent 


finis coronat opus 
hoc opus 
id est (i.e.) 
in aeternum 
in dubio 
in vino veritas 
in vivo 


januis clausis 
lapsus calami 
lapsus linguae 
littera scripta manet 
locus in quo 
magna est veritas et praevalebit 
mirabile visu 
multum in parvo 
mutatis mutandis 
ne quid nimis 
non sequitur 


nosce te ipsum 
nugae 
obscurum per obscurius 


onus probandi 
si vis pacem para bellum 
sic 

sine qua non 

status quo 
suo loco 

tempus fugit 

uno animo 
vincit omnia veritas 


the end crowns the work 
this is the hard work 
that is 
forever 
in doubt (undetermined) 
there is truth in wine 
in a living thing, 
or in the body of a work 
behind closed doors 
slip of the pen 
slip of the tongue 
the written letter abides 
place in which 
truth is mighty and will prevail 
wonderful to behold 
much in little 
with necessary changes made 
nothing in excess 
a conclusion that does not 
logically follow from the premises 
know thyself 
trifles 
(explaining) the obscure 
by the more obscure 
burden of proof 
if you wish peace, prepare for war 
so, thus?-1 
an indispensable condition 
state in which (the existing state) 
in its proper place 
time flies 
with one mind (unanimously) 
truth conquers all things 


D--lIn the current vernacular, this is used to mean “You scratch my back and I’ll scratch yours.” 
D.0This refers to a method of proof, which assumes the contrary of a hypothesis to be proved, and deduces 


an absurd consequence. 
D.1This is used to mean intentionally so written. Sic is used after a quote, calling attention to it, in order 


to indicate that it really does reproduce the original, or in the current vernacular, “Yes, they really did say 
that.” 
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Section 1.1 


1.1 


1.3 


1.5 


1.7 


Since Z[(1 + /n)/2] C Q(/n), then if it is a ring, it is an integral domain. Thus, by 
Remark A.2 on page 323 it suffices to show that it is closed under subtraction and 
multiplication. Closure under subtraction is easy to see, since 
1 1 n 1 
a+b gael ey =l(a—c+(b-d) gen ; 
2 2 2 
Also, since n = 1(mod 4) and 
1 1 -1 1 
iN oc Ve tie eel eee pay 
2 2 4 2 
there is closure under multiplication. For Z[,/n] there is no need to restrict to n = 1 
(mod 4) since we are dealing only with elements of the form a+ b,/n with a,b € Z. 
Hence, the above argument works in the same fashion to show it is an integral domain 
as well. 
If a € Uzj.,,], there exist a,8 € Zlw,] =D such that a8 = 1. So N(aB) = N(1) = 1. 
But since N(a’) = N(a)N(@) by Exercise 1.2, then N(a) = £1. Conversely, if 
N(a) = +1, then a = a+ b/n where 2a, 2b € Z, and hence a? — b?n = +1. Thus, 
B=a-—b/n€ D and a8 = +1, so a € Up. 
If a = $162 for B; € Z[/n], then B; = a; + b;./n with a;,b; € Z, j = 1,2. Since 
1 Bz = ajay + bybon + (ary + bya2)Vn, 
then 
p= |(a1a2 + by bon)? — (abe + bya2)?n| = | (aj = bin) (a3 = b5n)| ; 
so 


|a3 = be n| = 1 for one of j = 1,2. 


In other words, one of 6; for 7 = 1,2 is a unit in Z[,/n], by Exercise 1.3, so a is 
irreducible in Z[,/n]. 


The converse fails. For instance 2 = 2+ 0/10 = a + bv10 is irreducible in Z[V10], 
but a? — b?n = 4 in this case. 


Let Lp denote the set of units in an integral domain D. Then by +1p € Up. Also, 
given a,0 € Up, there exist a,, 0, € D such that 


aa,=I1p (S1) 


and 88, = 1p, so af(a,6,) = 1p, namely af € Up, proving that Up is closed 
under multiplication. Furthermore, Sp inherits the properties of associativity and 
commutativity from the integral domain D. Moreover, if a € Up, then by (S1), 
a, € Up is a multiplicative inverse of a. Hence Up is a multiplicative abelian group. 
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1.17 
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If a is irreducible and a nonunit 6 | a, there is a y € D such that a = by. However, 
since q@ is irreducible, y must be a unit, soa ~ (. Conversely, if the only divisors 
of @ are associates and units, any factorization a = By must be trivial. Thus, a is 
irreducible. 


It is false. If a = 4+ V10 and 6 = 4— V10, then N(a) = N(8) = 6. However, if 
4+/10 = (a+bvV10)(4— V/10), then 4 = 4a — 10b and 1 = 4b— a. However, plugging 
a = 4b — 1 into 4 = 4a — 100, we get that 6b = 8, a contradiction. Hence, a % £. 


Let a = 6 = 2-3 = (6+ V30)(6 — 30), where 2,3, (6+ V30) are irreducible, but not 
associates of one another. 


Since u = a + bi is a unit if and only if N(u) = +1 by Exercise 1.3, then a? + b? = 1. 
Hence, (a,b) € {(0, +1), (+1,0)} implying that the units in the Gaussian integers are 
given by 


Uzi = {+1, £2}. 
If y and 6 are gcds of a and £, then by (b) of the definition y | 6 and 6 | 7, so by 
Definition 1.5 on page 4, y ~ 6. 


The ring D = 2Z has 2 € D, but 2 has no divisors in D. Hence, 2,4 € D have no 
greatest common divisor. 


Section 1.2 


1.19 


1.21 
1.23 


Let a, € D be nonzero elements and set 
8={yE D:y7y=ca+ 68, for some o,d € D}. 


Since lpa+0 € § and 0+4+1p8 € §, then § consists of more than just the zero element. 
If f is the Euclidean function on D, we may choose an element yo = 09a + O98 € S 
with f(%) as a minimum. Now let y = ca+68 € 8 be arbitrary. By condition (b) of 
Euclidean domains in Definition 1.9, there are g,r € D such that 


Y¥=q yo +7, with either r = 0, or f(r) < f(y). 


Since 


r= 7-0 = 90+ 6B — q(a0@ + 508) = (o — gao)a t (5 — 950) 8 € 8, 
then if r £ 0, condition (b) of Euclidean domains tells us that 


f(r) = f((o — qa0)a + (6 — q60)8) < fro), 


a contradiction to the minimality of f(y). Thus, r = 0, and so y = qyo. In other 
words, Yo|7 for all y € 8. In particular yo|a and yo|G. Hence, yo is a common divisor 
of a and £ as required. 


Since 1p | a for all nonzero a € D, then by Exercise 1.20, (1p) < ¢(a). 


It is false. First we show that 3 is prime in D = Z{i]. Since D is a UFD by Corollary 1.1 
on page 13, then being prime is tantamount to being irreducible by Theorem 1.2 on 
page 7. So if 3 = (a+bi)(c+di), and c+di is not a unit, then since N(a+bi) = a? +b? 
and N(3) = 9 (with the impossibility of 3 being a sum of two squares of nonzero 
integers—see Theorem A.27 on page 343) one of c or d equals 3 and a + bi is a unit, 
namely 3 is prime in D. This provides the counterexample since 3 is prime but its 
norm is 9. 
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1.25 


We assume that D is almost Euclidean and prove that every irreducible element in D 
is prime. Suppose that 


$= os Bja; € D: a; € D is irreducible but not prime 
Bj;ED 


In other words, § consists of all finite linear combinations of elements in D which are 
irreducible but not prime. If 8 # @, there is an element a € 8 such that ¢(a) has 
least positive value by the Well-Ordering Principle — see page 340. By property (c) 
of an almost Euclidean function, if @ € & which is irreducible but not prime, and a 
does not divide 3, there exist x,y € D such that 0 < ¢(ax + By) < o(a). However, 
axz + By € § by definition, contradicting the minimality of ¢(a@). Hence, a | 8 for all 
irreducibles 6 € S that are not prime. Hence, a ~ £ for all irreducibles 6 that are not 
prime. 


Since a@ is not prime then by definition there exist 61,82 € D such that a | 8,6 and 
a does not divide 8; for 7 = 1,2. However, there exists a $3 such that 8182 = B3a, 
and by the definition of 8, 6,82 € S. Let 6 be an irreducible such that 6 | By or 
) | Bz. Without loss of generality suppose 6 | 8 ,. Then if 6 is not prime, from the 
above 6 ~ a, so there is a unit 6, with a = 66; and a 62 € D with (2, = 669. Hence, 
By = 67 ‘ado, forcing a | 31, a contradiction. We have shown that any irreducible, 
that divides 6, or 82, must be prime. If 6 | a, then since a is irreducible, a = du for 
a unit u € D. But again, assuming 6 | 6, we deduce that a | 8, as above. Hence, 
6 | 63. Given that 6, and 82 must be factorizable into a product of irreducibles, using 
the same argument as in the proof of Theorem 1.6 on page 13, we have shown that 
all irreducibles that divide 6,82 must divide $3, which implies that a is a unit, a 
contradiction. Hence § = @, which completes the task. 


Section 1.3 


1.27 


1.29 


Suppose that N(a) = a2 +b? = p is prime. If a = Gy for B,y € Zli], then 
Nr(B),Nr(y) € Z, so they both divide p. Hence, one of them, say Npr(G) = 1. 
This means that $ is a unit in Z[]. Thus, any divisor of a is a unit or associate. 
Therefore, @ is irreducible, and hence a prime in Z/i] by Corollary 1.1 on page 13. 
Since N(a) = N(a+ bi) = N(a — bi) = (a+ bi)(a — bi) = p, then p is not a prime 
in Z[i] since it is divisible by both a+ bi and a — bi, neither of which is a unit or an 
associate of p. Indeed, both a + bi and a — bi are primes in Z[i] by Exercise 1.22 on 
page 14. Also, p = 1(mod 4) or p = 2 since a prime is a sum of two squares if and 
only if p #3(mod 4) by Theorem A.27 on page 343. 


Since 0 € R; for all 7 € J, then 0 € jer R;, so jer R; x @. For any a,be jer R;, 
a,b € R; for all 7 € J, soa+b,ab € R; for all such j. Hence, a + b,ab € NjeaR;. 
Thus, the latter is a ring in R. To see that Uje7R; is a subring, we first note that 
it is nonempty since 0 € Uje7R;, as 0 is in every R;. If a,b € Uses R;, then there 
are k,f € g such that a € Ry, bE Re. If k < 2, then Re C Re, soa+be Rez, and 
we have additive closure since a+ b € UjesR;. Lastly, since each R; is closed under 
multiplication, so is U;e3R; and we have the result. 
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Section 1.4 


1.31 


1.33 


1.35 


It is valid. Here is a proof. Let N and M/N be Noetherian R-modules. If 
M,C M2 C::: (S2) 
is an ascending chain of R-submodules of M, for each 7 € N let 
M,;+N={m+N:me M;}. 
Then M; + N is an R-submodule of M/N and M; +N C Mj41+N. Hence, 
M+NCMe4+NC.::: (S3) 


is an ascending chain of submodules of M/N. Since M/N is a Noetherian R-module, 
then (S3) terminates so there exists an n € N such that M; +N C M,, +N for all 
j =n. Since M; 1 N is clearly a submodule of N and M;N.N C Mj419N, then 


M:NNCM.nANC::: (S4) 


is an ascending chain of submodules of N. Since N is a Noetherian R-module, then 
(S4) terminates. Hence, there exists an m € N such that Mj; N = M,N for all 
j>m. If we set N = max{m,n}, then for any j > N, 


M;+ N= Mj4i+ N and Mj ON = Mj NN. 


We want to show that (S2) terminates. Suppose it does not. Then there exists 
an jo € N such that M;, is properly contained in M,,41 for some jp > N. In 
this case we may select an element mj,41 € Mj,41 with mj,41 ¢ M;,. However, 
M41 © Mjj41 +N = Mj, +N, so there exists an m,;, € M;, and n € N such that 
Mjo+1 = Mj, +n. By rewriting, mj,41 — mj, =n € N. Moreover, Mj, € Mj,+41 so 
Mjo41—Mj, € Mj +1, which implies mj.41—mj, € Mj .41NN C M,,, a contradiction. 
We have shown that (S2) terminates, so M is Noetherian as an R-module. 


Let 
QCnc-:-: (S5) 


be an ascending chain of D2-ideals. Since D; is a Noetherian domain and Dp is a 
finitely generated D, module, then by Exercise 1.32, D2 is a Noetherian D,-module. 
Since I; for any 7 € N is a D,-submodule of D2 then ($5) must terminate, so D2 is 
Noetherian. 


If R does not satisfy the DCC, there exists an infinite nonterminating descending 
sequence of ideals {J;}, so there can exist no minimal element in this set. Conversely, 
if R satisfies the DCC, then any nonempty collection 8 of ideals has an element J. If I 
is not minimal, it contains an element J,. If J; is not minimal, it contains an ideal Io, 
and so on. Eventually, due to the DCC, the process terminates, so the set contains a 
minimal element. 
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Section 1.5 


1.37 


1.39 


1.41 


1.43 


1.45 


Since J+ 4. is an ideal, then given a € I,8 € Jyy © H, we have a(6+7) =ab+ay 
IJ +1IH anda(6+y)€l(J +H). 


Since an invertible fractional R-ideal I satisfies II~! = R, there exist a; € I~! and 
b; € I such that 1p = STL, aibi. Thus, if a € I, then a = >", (aaj)b;. Also, aa; € I 
for i= 1,2,...,n and since a; € 17-1 = {6G € F: BI C R}, then J is finitely generated 
as an R-module by the };, for 1 = 1,2,...,n. 


Suppose that J = Rb + Rbo +---+ Rb, where b; = c;/a; € F, for aj,c; € R, with 
a; #0 for j =1,2,...,n. Let a=[]j_, aj. Then a #0 and 


n n-1 
al = Re, |] aj; +--+ + Ren JT a; CR, 
j=2 j=l 


which makes J a fractional R-ideal. 


That (i) implies (ii) is Theorem 1.17 on page 28 and Theorem 1.16 on page 27. That 
(ii) implies (iii) is clear. That (iii) implies (iv) follows from Remark 1.13 on page 26 
and Theorem 1.16 on page 27. That (iv) is equivalent to (v) is Exercise 1.42. We now 
show that (iv) implies (i) to complete the logical circle. By Exercise 1.39, part (A) 
of Definition 1.23 on page 25 is satisfied. If a € F, the quotient field of D, then by 
Exercise 1.40, Ria] is a finitely generated R-module. Thus, by Exercise 1.41, R[a] is 
a fractional R ideal. Accordingly, since R[a]R[a] = Ria], then 


Rlo] = RR{a] = (Rla])~*(Rla]) Rla] = (Rlo])~* (R[a]fa]) = (Rla])~"Rla] = R, 


so a € R, which shows that R is integrally closed in F’. This is part (C) of Definition 
1.23. It remains to show that every nonzero prime D-ideal is maximal. Since we 
have part (A) of Definition 1.23, then by Remark 1.12 on page 26, a prime ideal P is 
contained in a maximal D-ideal M. Thus, by (iv) M is invertible. Hence, M~'P = I 
is a fractional R-ideal and 

M1?CM'1M=R, 


so M~! is an integral R-ideal. Moreover, since 
M(M~'?) = RP =P, 


and P is prime, then by Theorem 1.7 on page 16, either M C P or M~!P C P. If 
M-!? CP, then 


RCM1=M1t1R=M'9P 1 CPP ICR, 


which shows that M~-! = R. However, R= MM~! = MR = M, contradicting that 
M is maximal. Hence, M C P, which means that M = P, which is maximal. 


If J = (0) or J = R = (1), then one element suffices, so assume that (0) C IC R and 
let a € I such that a £ 0,1. Then (a) C J and I | (a) by Corollary 1.7 on page 27. 
Thus, there exists an R-ideal J such that (a) = IJ. 


Let & be the set of distinct prime R-ideals P; for 7 = 1,2,...,n such that either 
ordp(I) £ 0 or ordp(IJ) 4 0, or possibly both. Since J £ R, then 8 4 G. 
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By part (c) of Exercise 1.44, there exists a 8 € F, the quotient field of R such that 
ordp((8)) = ordp(Z) for all prime R-ideals P dividing J. Therefore, for all prime 
R-ideals P | I, 


ord (I) = min(ordp (I), ordp((@))) = min(ordp((3)), ordp((a))) = ordp((a) + (8)), 


by part (b) of Exercise 1.44. Hence, 


as required. 


Let D be an almost Euclidean domain and let ¢ be an almost Euclidean function of 
D. Let I be any nonzero ideal of D and let 


8 = {¢(a): a€ I} 


and let ¢(m) for m € I be a minimal positive value in 8. By part (c) of the definition 
of an almost Euclidean function in Exercise 1.25, given y € J and any x,y € D we 
cannot have 0 < ¢(yx+my) < ¢(m), by the minimality of ¢(m), so we must have 
7 = mq for some q € D. Hence, I = (m) since y was arbitrary. This shows that D is 
a PID. By Theorem 1.12 on page 21, D is Noetherian. 


No, since property 3 of Definition 1.24 on page 26 fails to hold given that there exists 
no integer r such that rJ C Z. 


Section 1.6 


1.51 


Taking the hint, it suffices to prove the result for n = 2, since we may extrapolate by 
induction from this case. 


Consider mMa,Q(x) = ie —aj;), where the a; are all of the conjugates of aj = a 
over Q, and let mga(z) = Tj21(x — 8;), where the £; are all of the conjugates 
of 8; = 6 over Q. Also, a; # a, for any 7 # k, and B; A fy for any i F &, by 
Corollary 1.14 on page 38. Select a g € Q such that q 4 (a — ax)/(G; — B) for any 
k =1,2,...,dq and any j = 1,2,...,dg, and let 


y= a+ BP, (S6) 
with 
f(x) = Ma,a(7 — ax) € Q(y) [a]. 
Since 
f(8) = ma,al(¥ — GB) = Ma,9(@) = 0, 
and 


m,o(8) = 0, 


then £ is a common root of f(x) and mg.o(x). We now show that this is the only 
common root. If there exists ao € C, with o 4 8, such that f(c) = 0 = mz,a(c) = 0, 
then o = 8; for some j > 1. Since 


0 = ma,0(@) = Ma,a(7 — GBi) = f(B;), 
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then there is a k € {1,2,...,dg} such that y — q8; = ay. Thus, by (S6), 


an + 9g8; =y =at gh, 


so 

a—-Ak 

q= , 

B; —B 
contradicting the choice of g. We have shown that (£ is the only ae, root of 
f(x) and mg,q(x). Therefore, by Theorem 1.23 on page 38, mg .Q(4)(x )| £¢ f(x) and 
™B,Q(+)(& ) | mg,q(x). However, since f(a) and mg.o(z) have only one Ae in com- 
mon, — 


deg (™,e(+) (x)) =; 


Thus, mg q(7)(z) = 2+ 6 for some 6 € Q(y). Since mg qry)(8) = 0 = 8 +4, then 
B= —6 € Q(7), soa = y— G8 € Q(y). This shows that Q(a,8) C Q(y). However, 
since y = a+q8 € Q(a,f), Q(y) © Q(a,8). We have completed the proof that 
Q(a,8) = Q(y), as required. 


( 
Clearly, Qi +i)) C Q(i, V2). To see that equality holds, we observe that 


(: + ‘ ae ¢ 
S=4- 5 
v2 : 
( 1+ ‘) 
V2 
is a primitive eighth root of unity, and so is any odd power thereof. Since 


QU, V2) : q|=1=|0 (4): Q. 


so 


then we must have 


ati, v3) = 0 (+). 


Section 1.7 


1.55 Let M be a Zmodule. If r € Z, and m € M, then 


r-m=m+:--M, 
—>J_S—-—“_’ 
: ag 
so the properties of an additive abelian group are inherited from this action. 


Conversely, if M is an additive abelian group, then the addition within the group 
gives the Z-module action as above. 


1.57 We only prove this for o = 1, since the other case is similar. 
Suppose that J is an ideal. Therefore, aD € I, so cla by the minimality of c. 


We have 
VD(b+ceVD) =bVD+cD eI, 
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so c|b. Moreover, since 


(2-vD) @+evd)=" ="? er, 


Cc 


then 
a|(b® — e?D)/c. 


In other words, 
ac|(b? — cD). 


Conversely, assume that J satisfies the conditions. To verify that I is an ideal, we 
need to show that aVD € I and (b+ VD)VD € I. This is a consequence of the 
following identities, the details of which we leave to the reader for verification: 


aVD = ~(b/c)a + (a/c)(b + cVD), 


and 
b/D + cD = —(b? — ?D)/c + b(b+cVD)/c, 


so I is an ideal. 


1.59 If [a,6] = [7,6], there are integers x, 20, y, Yo, Z, 20, W, Wo such that 


a=r2y+ 96,0 =wyt+ 20, 


and 
y=ratyoP,d =woat 20/. 


These two sets of equations translate into two matrix equations as follows. 


where 
x= q 
and 
a 
(3) = (5) 
where 
ce) 
Hence, 


(5) = #065} 


Therefore, X Xo = Jz and hence the determinants of X and Xo are +1, so the 
result follows. 

Conversely, assume that the matrix equation holds as given in the exercise. Then 
clearly 


[a,8] < [7,4]. 
Since the determinant of X is +1, we can multiply both sides of the matrix 
equation by the inverse of X to get that y and 6 are linear combinations of a 
and 8. Thus, 


[7,4] & [a,8]. 
The result is now proved. 
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1.61 Let 
Ji, = (ai, (b; + VA) /2) for i = 1,2 


be Oy-ideals such that Jj Jz C P. Then by the multiplication formulas given 
on page 48, J, J. = (a3, (b3 + VA)/2) where a3 = aja2/g = 0(mod p) with 
g = gcd(a1, a2, (b1 + bz)/2)). If p t az (which means that Jz Z P), then p | a1, 
since p cannot divide g, given that it does not divide ag. Thus, to show that 
J, C P, it remains to show that b; = 2pn + b for some n € Z, by Exercise 1.60. 
Now, by Exercise 1.57, 


b} =A (mod 4a;) and b? = A (mod 4p), 


so b? = b? (mod 4p). Since p is prime, then b; = +b(mod 2p). If 
by =—b (mod 2p), then Jy C P’ = (p, (—b + WA)/2), 


so if (—b + VA)/2 € ?, then J; C P so we are done by Theorem 1.7 on page 16. 
If (-b + VA)/2 ¢ P, then PNP’ = (p), so a3 = 1, and this forces p | 1, a 
contradiction. The remaining case is b; = b(mod 2p), so b; = 2pn + b for some 
n € Z, as required. 


Section 2.1 


2.1 Suppose that 6 is an embedding of F in C with 6(a) = 8. Since 


d—1 


0 = mMa,Q(a@) = S- qo! with gq; € Q, 
j=0 


then 
d-1 d-1 dad 
0 =0(0) =0 So get = 3° g(a) = 
j=0 j=0 j=0 


Thus, 8 = a; for some j = 1,2,...,d. Thus, there are at most d embeddings of F 
in C. Now we show that if 0; is defined by 0;(f(a)) = f(a;) for j = 1,2,...,d, with 
f(x) € F[a], then 6; is indeed an embedding of F in C. To do this, we first show that 6, 
is well-defined. If f(a) = g(a) for f(x), g(x) € Fa], then f(a) — g(x) = h(x) ma,0(2) 
for some h(x) € F[z], so f(a;) — g(a;) = h(a;)ma,o(a;) = 0. Hence, 6;(f(a)) = 
f(a;) = g(a;) = 9;(g(a@)), so 9; is well-defined, and the conjugates of a are the a,;, 
which in turn are precisely the roots of mag(x). Lastly, we demonstrate how the one- 
to-one property follows. Suppose that 0;(f(a)) = @;(g(a)). Then f(a;) = g(a;), so 
as in the above, f(x) — g(x) = h(x)ma,,9(x). Thus, f(a) — g(a) = h(a)ma,,e(a@) = 0 
since 0(a) = aj. 


2.3 By Theorem 1.23 on page 38, mo,9(x) € Z[z], so by Exercise 2.2, far(x) € Z[z]. 
We have shown that the F-conjugates of a are roots of a monic polynomial with 
coefficients in Z, namely they are algebraic integers by Definition 1.28 on page 35. 


2.5 If all the F-conjugates are distinct, then f.,7(x) is a product of distinct linear factors. 
Thus, by Exercise 2.2, we have t = 1 and ma,9(x) = fa,r(x). Therefore, 


|Q(a) : Q| = deg(ma,e(#)) = deg(fa,r(x)) =d=|F: Ql. 
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However, since a € F’', Q(a) C F, so F = Q(a). 
Conversely assume that F’ = Q(a). Then 
deg(ma,9(2)) = |F': Ql =d. 


By Exercise 2.2, this implies that t = 1 and mo,9(%) = foa,r(x). Hence, the F- 
conjugates of a are distinct. 


2.7 Since the minimal polynomial of a over Q is of the form 


Ma(x) = a? + be +c € Za], 
then a? + ba+c=0. Therefore, by the quadratic formula, 
a = (—b+ Vb? — 4ac) /2. 


Since we may remove all square factors from b? — 4c = s?d, and since a € Q(V4d), 


then Q(a) = Q(v4). 


2.9 Since /ni + /n2 € Q(/n1,V/n2), we need only show that \/ni, and \/n2 are in 


2.13 


2.15 


Q(/ni + /nz). Since 
(Vi + Vina)? = my +12 + 2/1 /N2, 
then \/ni,/n2 € Q(\/ni + /nz). Also, since 
my — 2/1 y/nz + nz = (rr — V/n2)” € QV + na), 


then 


(Vii + V2) (mi — 72)” = (m1 — 2) (Vm — 72) € Qi + Vina). 
Therefore, \/ni — \/n2 € Q(,/ni + \/nz). Hence, 


Vins = 5( vin — its + Via + Via) € (Vin + Va), 
and similarly, /n2 € Q(,/ni + \/n2). 


It remains to determine the Galois group. Let oj : \/nj > —,/nj for j = 1,2 with 
o1(./n2) = V/n2 and o2(,/n1) = \/ni. These are distinct Q-automorphisms of K, 
of order 2, and since |K : Q| = 4, Gal(K/Q) = (01) x (02), the Klein four-group, 
namely is the direct product of two distinct cyclic groups of order 2—see Remark A.1 
on page 321. 


This is immediate from Exercise 2.1 since the complex embeddings come in conjugate 
pairs. 


By Fermat’s Little Theorem, we get that (a + 6)?” = a?” + 6?" for all a,6 € F. 
Since 1p +> 1p, then o fixes Fp. 


Since F* has order p” — 1, every nonzero a € F satisfies a?’ ~! = 1p. Therefore, 
every nonzero a € F is a root of x?”~!—1p, so also a root of f(x) = 2(a?"~!-1p) = 
x?” —x € F,|[z]. Since f(0) = 0, then f(x) has p” distinct roots. In other words, f(x) 
splits over F. It remains to establish uniqueness. If K is a splitting field for f(a) 
over F,, then f’(x) = —1 and gcd(f (a), f’(x)) =1. Therefore, by part (b) of Exercise 
2.14, f(x) has p” distinct roots in F. Let 0 : F +> F be given by Exercise 2.13. Thus, 
a € F isa root of f(x) if and only if ¢(a) = a. Hence, the subfield of F having all 
roots of f(a) in F' must in fact be F’, so uniqueness is proved. 
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2.17 From the properties of embeddings, we get 


and 


2.19 Since the embeddings of F' in C are 


6,:V74 V7, and 62: V7 —-V7, 


Tr(a) =a+a'= (4) + (54) =, 


Nr(a) =a-a’ = (1-—7)/4 = —-3/2. 
Therefore, by Theorem 2.5 on page 66, 


then 


and 


Mo(z) = 2? — Tr(a)x + Np(a) = 2? — 2 — 3/2. 

2.21 Suppose that a = (a + b,/p)/c € F with a,b,c € Z, gced(a, b,c) = 1, and Np(a) = 2. 
(The gcd condition may be assumed without loss of generality since we may otherwise 
divide out the common factor.) Then 

ce? Nr(a) = a? — bp = 2c”. 
If c is even, then a and b are both odd by the gcd condition. Thus, 
l=a’?=0l*p=p (mod 8), 
a contradiction. Therefore, c is odd, so b and c must both be odd. Hence, 
2= 2c? =a” (mod p). 


However, this is false since 2 is a quadratic residue modulo a prime p > 2 if and only 
if p = +1(mod 8)—see (A.10) on page 342. 
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Let a = V2+ V3. Then 
Mo(z) = 2* — 102? +1. 


Note that if F = Q(v/2 + V3), then Tr(a) = 0, and Nr(a) = 1. Thus, we are aided 
by Theorem 2.5. 


We have 


p-l p-l p-l 


Tr(1— G) = Di A-G)= D1 DG. 
=1 j= j= 


J 
However, by Example 1.5, ea ¢i = —1. Therefore, 


p-l 


Tr(1—- G) =} )1-(-1) =p-1-(-l) =p, 


j=l 


as required. 


3 
3=F4(o+3)+ $5 (gt 03)” — 5. 
To prove the hint, we invoke Exercise 1.54 on page 43 and Theorem 1.25 on page 40 
to get 
uP —1 


x—1 


= TI (@- @) = me, (a), 


then by differentiating the left and right-hand sides we get, 


px? —pxP-!—aP +1 _ 
(x — 1)? = 2 II (a — Gp). 
1<k<p-1 
j=l 
j#k 


Therefore, if we substitute « = Ce into the left-hand side, we get 


p=pGP > peG-1)_ pee 
(Gane Gate ea 


and for the right-hand side, the substitution yields, 


p-1 
(Cees 
j=l 
j#t 
We have shown that: 
p= t p-1 
Ca I (s7) 
Pp 
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2.31 


and since m¢,(1) = p, then 


I 1-@)=p=|[(@-)). 


Therefore, by ($7), 


= = Po j I: (eieeme eg p—-1 p—2 
11 fl « a 1g ce eget ae ene 
j=l fei j= 

j#t 


This last equation has (p — 1)(p — 2) factors, half of which have i < j, so 


II (G5 - i= a 1)°- We-2)/2 II (iG — Gy, 


1<i<j<p-1 1<i<j<p-1 


which, since p > 2, is equal to 


epee ile Waar 


1<i<j<p-1 
as required. 
By Exercise 2.29, 
disc(™a,Q) (a; — aj)? 
1<i<j<d 
Also, 
dd 
mi,g(t)= >> J] @- ai) 
Jee t=1 
tAj 
Therefore, 
d 
Mea Q(a;) — II (aj 7s Qi), 
i=1 
iAj 
so 


d 
Nr(m4,e(a;)) _ II mg (5) 7 Il (a; — ai) II (a; — ai), 


1<i<j<d 1<j<i<d 
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and since there are d(d — 1)/2 pairs (7,7) with 1 <i <j < d, then the above equals 


(1892 TT (ay — a4)? = (1) dise(ma,9)- 
1<i<j<d 


This completes the proof. 
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Section 2.3 


2.33 We use induction on d. If d= 2, then 


=A2 —- QQ. 


ay a2 


This is the induction step. Now assume that the result holds for all such n x n matrices 
with n < d. If cof(A;,;) denotes the cofactor of the matrix A = (a-"), then by (A.6), 


d 
det(a‘*) = > ai *cof(Ai,;). 
j=l 
By induction hypothesis, the result holds for each A;,;, so the entire result holds. 


2.35 Let F = Q(V2) = Q(a). Then a has minimal polynomial m,,9(a) = x? — 2, so via 
Exercise 2.31, 


dise(ma,q) = (—1)“*- YP Np(m, oa) = (-1)°P-9? (2V2)(—2V2) = 8. 


2.37 We have n = |F: Q| = 8, and d = |Q(a) : Q| = 4. Also from Exercise 2.36, 
To(a)(@) = 0, and Neia)(a) = —5. Therefore, by Theorem 2.5, 


8 
Tr(a) = qi Xa) (a) =0, 


and 
Nr(a) = (Nava) (a))*/4 = (—5)? = 25. 


2.39 Since Op = Zia], then B = {1,a,...,a%'} is an integral basis for F, where |F : 
Q| =d. By Exercise 2.38, disc(B) = disc(ma,g). Hence,A pr = disc(ma,Q). 


2.41 Suppose that M = NON. If r € R, then 
MronN =(NO@Ni)rNN = (NrO@Mr)AN = (NroN)@(NMirnN) = Nroan = Nr. 
2.43 By Theorem 1.24 on page 39, a basis for Q(a) over Q is {1,a,07,...,a¢!}. Let 
Ma,Q(x) = ee seg og e+e tee. 


be the minimal polynomial of a over Q. Since aa’ = Sey bina® = biigiattt =q'tl 
then the matrix B = (b;,;) is given byS! 


b 


0 1 0 0 

0 0 1 0 
B= : 

0 0 0 1 

Co C1 c2 Cd-1 


S1Tm linear algebra B is called the matrix of the transformation a-+a-a*. Also, the trace of a is given by 
Y9e1 53,7, and the norm of a is given by det(B). These are alternative definitions of the norm and trace that 
we are now showing to be equivalent to the ones we gave in Definition 2.4 for the more general number field 
F (see Theorem 2.5 for the relevant connections). 
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By Exercise 2.42, we may form a basis {6),...,8,} for F over Q, where 8; = a! for 
j =0,1,...,d—1. Also, by Theorem 2.5 on page 66, d|n, and since a3; = Dp 4,5 Bj 
then the matrix A = (a;,;) must have determinant 


B 
B 
det(A) = det : : 
B 
where there are n/d blocks of B on the main diagonal. Since 
| det(B)| = |eo|, and | det(A)| = | det(B)|"/%, 


then 
| det(A)| = |co|"/%. 


However, by Theorem 2.5, 
leo] = [Ng(ay(@)|, and (Ng(a)(a))"/? = Ne (a), 


| det(A)| = |Nr(a)}, 


as required. (The reader should compare this with Exercise 2.2 on page 62.) 
Section 2.4 


2.45 Suppose that 
r= [24 
j=l 


for distinct prime 0 -ideals P;. Then since 
N(I) = [J N (Ps) 
j=l 


is prime, we must have r = 1 = ay, so I is prime. 


2.47 Since there are nonzero a,3 € Op and integral D-ideals I, J such that 


1 1 
J=-—Iandgjg=-—J, 
a B 


then ; 
Jd = —I J. 
d ap 
Therefore, by Definition 2.8 on page 83, 
N(LJ) 
IND) ae a 
N((a8)) 
so by Definition 1.15 on page 16, 
N(LJ) 
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and by Exercise 2.46, 
N(LJ) 
N(Jg) = 
N((a))N((8)) 
Hence, 
NZ) N(J) 
N(9) = = = NDNA). 
N((a)) N((B)) 

2.49 Let N(P) =[]j_1p;"". By Exercise 2.48, P | (N(P)). Therefore, P divides one of the 
principal ideals (p;). If P | (pz) for some k 4 7, then by the Euclidean algorithm, there 
exist u,v € Zsuch that up; +vp, = 1. Since up;, vp, € P, then 1 € P, a contradiction. 
Hence, N(P) = py"! = p™ a prime power. Thus, N(?P) | N(p) = p” = pil, so 
N(P) = p™ for some m < n. 

2.51 All ideals in Z[/10] of norm 6 have the form [6,a + bv/10] where 

a” —10b? =0 (mod 6) 
by Exercise 1.57 on page 53. Thus, [6,2 + /10] and [6,2 — 10] are two of them. By 
Exercise 1.59 on page 54, this is all of them. 
2.53 This is a direct consequence of Exercises 2.48 and 2.52. 
Section 3.1 


3.1 Clearly, sincef (x,y) = g(X,Y) for 


X = px + qy (S8) 
and 

Y=ra+sy, (S9) 
then equivalent forms represent the same integers by definition. Since ps — gr = +1 


and from (S8)-(S$9), « = +(sX — qY) and y = 4(rX — pY), so gcd(z,y) = 1 if and 
only if ged(X,Y) = 1. 


3.3 Suppose that f(x,y) = g(X,Y) where X = pr + qy, Y = ra + sy, and ps — qr = 1. If 


we set « = X and Y = y, namely p= s = 1 and q=r =0, then f(z, y) = g(x,y) and 
we have the reflexive property. Also, since 


g(X1,¥1) ra f(x,y), 


where X; = sx — qy and Y; = py — rz, then we have the symmetry property. 


Lastly, for transitivity, assume that 
g(X,Y) =h(PX + QY,RX + SY), 
where PS — QR=1. Then since 
PX + QY = P(pxr+qy) + Q(ra + sy) = (Pp+ Qr)x+ (Pq + Qs)y = Pix + Qiy 
and 


RX + SY = R(px + qy) + S(ra + sy) = (Rpt+ Srjat+ (Rat Ss)y = Ria + Sry 
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we have 


PS, — Qi Ri = (Pp + Qr)(Rq + Ss) — (Pq + Qs)(Rp + Sr) = 


PRpq+ QRrq+ PpSs + QrSs — PqRp — PqSr — QsRp — QsSr = 
QR(rq — sp) + PS(ps — qr) = -QR+ PS =1, 


so 
f(x,y) = A(Pyx + Qiy, Rix + Sy), 
with P,S; — Q,R, =1, which is the transitive property. 


3.5 If f~g, f = (a,b,c), g = (a1,b1,c1) with f primitive, then 


ax? + bry + cy? = a,(px + qy)? + bi (px + qy)(rx + sy) + cr (ra + sy)? 


(ap? + bipr + cyr?)a? + (2pqay + (ps + rq)bi + 2rsc,)xy + (q?aq + qsdi J 
so if gcd(a1, b1,c1) = g, then g | gcd(a, b,c) = 1, and the result is secured. 
3.7 Applying the substitution x = pX +qY and y=rxX +sY to the form 
f(a, y) = ax? + bay + cy’, 


we get the form AX? + BXY + CY?, where 


A = ap? + bpr + cr’, 


B = 2apq + b(ps + qr) + 2cers, 


C = aq? + bqs + cs”. 


A straightforward calculation shows that 
B? — 4AC = (b? — 4ac)(ps — qr)’, 
which yields the result. 
3.9 If the primitive form f(x,y) properly represents n € Z, then 


f(a,y) = na? + bry + cy” 
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c18)y?, 


may be assumed by Exercise 3.2. Therefore, D = 6? — 4nc. Thus, D is a quadratic 
residue modulo n. If n is even, then D = b?(mod 8) where b is necessarily odd, so 
D = 1(mod 8). Conversely, if D = 6? (mod |n|), where n is odd, we may assume that 
D and b have the same parity by replacing b by b+ n, if necessary. Therefore, since 
D =0,1(mod 4), then D = b? (mod 4|n|), which implies that there exists an integer 
m such that D = b? —4mn. Hence, nx? + bry + my” properly represents n and has 
discriminant D. Lastly, since gcd(D,n) = 1, then ged(n, b,m) = 1, so nx? +bry+my? 
is primitive. If n is even and D = b? (mod 4|n]|), then there exists an integer m such 


that D = b? — 4mn and we proceed as above. 


3.11 Let f(x,y) = ax? +bry+cy? be a reduced form of discriminant D < 0. Thus, b? < a? 


and a <c. Therefore, 


—D = 4ac — b? > 4a? — a? = 307, 
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whence, 
as V(—D)/3. 


For D fixed, |b| < a. This together with the latter inequality imply that there are only 
finitely many choices for a and b. However, since b? — 4ac = D, then there are only 
finitely many choices for c. We have shown that there are only finitely many reduced 
forms of discriminant D. By Theorem 3.1 on page 90, the number of equivalence 
classes of such forms is finite, which is the required result. 


Since a reduced form has coefficients satisfying b? < a? < ac and b? — 4ac = D, then 
D =? — 4ac < —3ac, 
so ac < —D/3. When D = —4n, this means that 
ac < 4n/3. (S10) 


We use (S10) to test for values up to the bound to prove the result. 


When n = 1, this means that ac < 4/3 so a = c = 1 is forced and b = 0. Hence, the 
only reduced form of discriminant —4 is 27+ y?. If n = 2, then ac < 8/3, soc = 2 and 
a = 1 is forced given that ac must be even since b? — 4ac = —8. Therefore, b = 0, and 
the only reduced form of discriminant —8 is x? + 2y?. If nm = 3, then ac < 4. Again, 
since ac must be even, c > a, and gcd(a,b,c) = 1, then c = 3, a = 1, and b = 0 is 
forced. Thus x? + 3y? is the only primitive reduced form of discriminant —12. (There 
is one imprimitive form, namely 2x? + 2xy + 2y?, which we do not count.) If n = 4, 
then ac < 16/3 < 6. With the caveats as above, we must have c= 4,a=1,b=0, 
so a? + 4y? is the only primitive reduced form of discriminant —16. (There is one 
imprimitive form, namely 2x? + 2y?, which we do not count.) 


Lastly, if n = 7, then ac < 28/3 < 9, and (b/2)? + 7 = ac, so the only possibility is 
c=7,a=1,andb=0, so 2? +7y? is the only primitive reduced form of discriminant 
—28. (There is one imprimitive form, namely 2x7 +2xy+4y?, which we do not count.) 
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3.15 


If a ~ —a, then there exist p,q,r,s € Z such that ps — gr = 1 and in the case where 
Ar = 0(mod 4), 
Ar Ar 
x? — Hy" = —(pe + gy)” + =o (rx + sy)” 
By comparing the coefficients of x”, we get 
A 
2 F2 
peepee eee | 
Pp A r , 


so p+r/Ap/4 is a unit of norm —1 in Of = Z[,\/Ap/4]. 
WhenA - = 1(mod 4), then 


eg ne 
4 Fy? = —(pe + qy)? — (pe + ay)(ra + sy) — 


x? + ay 4 F (rx + sy)?. 


By comparing the coefficients of 2? we get that 


(2p +1)? — Apr? = —4, 
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3.19 


so 
pt 


1 + VAP 
2 


is a unit of norm —1 in Or = Z[(1 + VAr)/2]. 


Since we have that 
Tas ~ Tap Par 


= > — - 
Bt Pa, Bx 


then, when F' is real, by Exercise 3.15, CS. = Co, if and only if Or has a unit 
of norm —1. When F is complex, then Pa, = Pe. since all norms are positive, so 
CE. = Cp,. This proves the assertion. 


Using the hint, we see that when b? — 4ac = Ar = 0(mod 4), then b is even so 


A 
acx? + bay + y? = (br/2+ y)? - 7 


since comparing the coefficients of x7, we get b?/4 — Ar/4 = ac, comparing the 
coefficients of ry we get b = b/2- 2, and the coefficients of y? are both 1. When 
Ar = 1(mod 4), then b is odd so 


b+1 ; b+1 ines 
vce? + boy y= (— < vy) +( 5 x y) 04 Z Fe’, 


since comparing the coefficients of x? we get 


5 = = ac, 


b+1\" b+1 1~Ap _ 8 4+2b4+1~2b~ 241-0? + 4ac _ 
2 4 4 


and comparing the coefficients of xy we get 
2:—— -1=6, 


and the coefficients of y? are both 1. 
Seta=1l+uifuA—-1,anda=VAr ifu=-1. IfuF -1, then 


(l+u')u=utuu =u+Np(u)=u+l. 


Therefore, 
a util 
a wtil- ba 
If u = —1, then 
a VAr 
= =-l=u, 
a! —/Ar 


as required. 
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3.23 


This is proved in the same fashion as the solution of Exercise 2.43 presented on page 
378. 
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Let S C R” be the set of points defined by the inequalities 
|Fy (a1, Lae ,In)| <ca+e, 
and 
Fel aige.. yen) |< Ge - SD.) 
where 0 <e€ <1. Then S is convex, bounded, and symmetric. Hence, 


n 


C1 Cn 1 1 
di,+-: —______qdy,, = 2” ———._ > 2"D(L). 
vis)> fd f” ate Nema 


—Ce1 —Cn 
Then Minkowski’s Theorem 3.9 yields the result. 


Let L be the set of all points P € R” that satisfy the first system of equations in the 
exercise for 7 = 1,...,k. Therefore, the implication given in the second display of the 
exercise guarantees that L is an additive subgroup of R”. Also, by that implication, 
if 
Fi(x) = Fj(y) (mod mz), 

then x and y are in the same coset of Z” modulo L. Hence, the number of such 
cosets is at most es m,;. Thus, L is a free abelian group of finite index in Z”. 
Therefore, LZ has rank n, by Exercise 3.24, so L is a lattice. Also, by Exercise 3.23, 
D(L) < ls m,;. Hence, V(S) > 2”D(L). Now, we apply Exercise 3.25 to get the 
result. 


F is clearly a Q-algebra. Also, if a,6 € F, then 
Or(a+ 8) = (Ai(a + B),.-.,Ortro(at B)) = 


(01 (a) + 01(B), -- +, Ory tre (@) + Ory 402 (8)) = 
(01(a),--- Ori tro (@)) + (01(8),---, Orr 4r2(8)) =O F(a) + Or (8), 
and 
Or(aB) = (A1(8),..-,Or4ro(B)) = 
(91 (:)A1(B), -- «Ory tre (@)Or, +72 (8)) = 


(01 (a), ee Ory tre (a))(A1(8), as) Ory tr (B)) =0 F(a)Opr(£), 


soO pf is a ring homomorphism. If q € Q, then since 0;(¢q) = gq for all j, then 
Or(qa) = qOr(a) for allg € Qanda€ F’. Hence,O fF is a Q-algebra homomorphism. 
Finally, since @; is a monomorphism for each j, then it follows thatO p is a Q-algebra 
monomorphism. 


By Exercise 3.18, J" = (8) for some 8 € Op. Let a = B'/"F EA. Since Opa) = 
An F(a), then a € Dp). Also, 


hr 


(Or(al)"” = Opal"? = Op(a)(8) = OF(a)(a)"*? = (OF(a)(@)) 


By Theorem 1.17, 
DO r(ayl _ D F(a) (Q). 
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3.33 By Exercise 1.45 on page 34, there are 6, € Op, such that J = (6, ). Substituting 
this into the given ideal equation we get, 


Therefore, 
y=dd+vp 
for some \,v € Ox. By Exercise 3.31, 6,4 € Of (q)(a). Thus, there are 7,¢ € Opa) 
such that 
6 = na, and w= Ca. 
Hence, 


y= Ana + va = a(An + v6), 


so a | yin A. A similar argument shows that + | a. Hence, a and ¥ are associates in 
A, so there exists a unit u € A such that y = wa, as required. 


Section 3.4 


3.35 For j ranging over the values 1 < j < p* with gced(p, 7) = 1, we get 
: pe _ a Ba) aaa 
eal en es kp 
IIe Cpe) ~~ ptt —] — sa : 
j k=0 
Then set x = 1. 


3.37 By Exercise 3.35, 


Also, by Definition 1.32 on page 40 and Exercise 3.35,® ,,(1) = p. 


Np (2) - ee - 


3.39 Let n = gee ps’ be the canonical prime factorization n. For the balance of the 


solution, all sums and products range over 7 € N such that j < Pp; and p{j. A 
simple induction shows that 


2a;—1 


g=\j=4 2 


since n > 2. Therefore, by Theorem 2.5 on page 66, 


= 1) 


=0 (mod P;"), 


(0) /O(C 45) _ \ O()/ OC 45) So(n)/$(C 25) 
Nelo) = (Negen (Ge) : ={ 2) : fo=l 


Thus, 
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3.41 From the additive property of the logarithm, £r(a3) = £r(a)+£r(8). Hence, Lr 
is a group homomorphism. 
3.43 If T and U are both even, then <q, € Z[/Apr], so since 
Z|V Ar] C Or =Z[(14+ VApr)/2), 
then G = (€q,,) if and only if T and U are even. 
3.45 Let I € Co,.2. Then J? ~ 1 and I ~ I’ by the preamble to this exercise. Thus, by 
Definition 3.7 on page 100, there is an a € F' such that J = al’. Taking norms we 
have N(I) = N(J’), so Nre(a) = +1. WhenA pf < 0, then Nr(a) = 1. WhenA fF > 0 
and Nr(a) = —1, we may multiply a by er to get Nr(era) = 1. Hence, without loss 
of generality, we may assume that Nr(a) = 1. Therefore, by Exercise 3.21, there is a 
8 € Op such that a = §’/8. Therefore, J = BI is an ambiguous ideal in I since 
J'’=('l' = Bal = Bl= J. 
3.47 Suppose that I = (N(J), (b-— VApr)/2). If 
[=I' =(N(I),(6— VApr)/2), 
then 
(b+ VAr)/2—(b— /Ar)/2= V/Are I. 
Thus, I | (Ar), so by Exercise 2.46 on page 86, N(I) | Ar. 
Conversely, if N(I) | Ar, then by Exercise 1.57 on page 53, N(J) | b sinceA Ff is 
not divisible by the square of any odd prime. Therefore, —b = 0(mod N(JI)), so by 
Exercise 1.60 on page 54, J = I’. 
3.49 Suppose that (3) = (6’), where 8 may be assumed to be primitive. Then there is 


a unit u € Lo, such that u = 6/8’. We may assume, without loss of generality, 
that there are no nontrivial rational integer factors in 6. Thus, u = +e, for some 
nonnegative n € Z. Ifu=e'k,, then set p = 8/a". Therefore, 


p' = B'/(a’)” = B'B/((a")"B) = B'B/(a"B’) = B/a” = p, 

where the third equality follows from the fact that 
EA, = (a/a’)” = 8/8" implies (a’)"8 = ap". 

Hence, p = z € Z, so 8 = az, but there are no nontrivial rational integer factors 
in 6, so |z| = 1. Hence, 6 = +a”. However, by Exercise 3.47, Nr(3) = Nr(a”) 
dividesA p-. If n > 1, then |Nr(a)|” = 4 is the only possibility, namely n = 2 and 
Nr(a) = £2, since the only possible square dividingA pr is 4. Thus, a = «+ y/Dr 
for some x,y € Z with x? — y?D = +2. Therefore, 

=+(2? +y?Dp t+ 2ryV/ Dr) = +(+2 + 2y?Dr + 2ry\V/ Dr) 

= £2(+1+ y?Dp + cy Dp), 

so 2 is a nontrivial rational integer factor of 8, a contradiction. Hence, n = 0, or 
n=1. Ifn=1, then (8) = (a), and if n = 0, then (8) = (1) = Or. 
If 6 = —e 6’, then set p = B/(a"/Dr). Again we get that p = p’ as above. Hence, 
ta”"/Dr = 6, and Nr(p) | Ar, so again n = 0,1. Thus, either (6) = (/Dr) or 
(8) = (aVDr). 
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3.51 


3.53 


By Exercise 1.57 on page 53, we may set 
T= (N(D), (6+ VAr)/2), 
so I’ = (N(I),(—b + VAr)/2). Then by the multiplication formulas on page 48, 
I’ =(N(J)), 
with a] = a2 = N(I) = 4g, and a3 = 1. 
By Exercise 3.52, Nr(€a,) = —1, where ea, = (r+5,/p)/2 for rational integers r = s 


(mod 2). By taking 
3 
a r+ 8,/p 
t+ Yy/p = (=) 


we get that x2? — py? equals 


Nee + yp) = Ne (4>4)) : (%: (rte) Renee 


where x, y € Z is verified by a simple check. 


Assume that I is reduced and let I = [a,a], where a = N(I) and a = (b+ /Ar)/2. 
Set 
Bo = |-a’/ajat+ael. 


Then, since |—a’/a| > —a’/a—1, 
83 = la! /aja—a" <a. 


If 85 < 0, then |8o| = —Go > a, by the definition of reduction. Therefore, 


|-a’ /ala — a = —Bp = —|-a'/ala—a>a> || =—|-a'/ala—a’. 


Hence, 

(b- JAp)/2 =a! > a= (6+ VBr)/2, 
namely —/Ap > VWAp, a contradiction. Hence, 69 > 0. Therefore, there exists a 
least element 6 € I such that |6’| < a and 8 > 0 (possibly 8 = 89). Since I is reduced, 
then 8 >a. Also, since 0 < 6—a < 8, then |6’ — a| > a, by the minimality of 3. If 
B' > 0, then a — 3’ = |6’ — a| > a, so B’ < 0, a contradiction. Hence, 


—a=—-N(I) <p’ <0. 
Since 6 € I, we may let, 
8B =am-+agn for some m,n € Z. 
If |n| > 1, then let m = s + nt for t € Z and |s| < |n|/2. Set 
y =|6—sa|/n=|a+at| el. 


Therefore, 
l7'] = |(6" — as)/n| < |B"/n| + |as/n| < a/2 + a/2 =a. 


However, 


y= 11 < [6/n| + las/n| < 8 2+6/2=8, 
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contradicting the minimality of G. Hence, |n| = +1. Therefore, by Exercise 1.60 on 
page 54, I = a, 


Conversely, suppose that I = [a,a] such that a > a, and —a < a’ < 0. If I is not 
reduced, there is a y € I such that |y| < a and |y'| < a. Since y = ma+ na for some 
m,n € Z, then 


|ma+na| <a, (S11) 


and 
|ma +na'| <a. (S12) 


If mn > 0, then (S11) is contradicted. If mn < 0, then (S12) is contradicted. Thus, 
mn = 0. If m = 0, and n 4 0, then (S11) implies that |a| < a, a contradiction. If 
n = 0, and m 4 0, then (S12) yields a contradiction. Therefore, m = n = 0, and I is 
reduced. 


First, assume that F is real. If J is reduced, then by Exercise 3.55, there is a G € I 
such that 8 > N(I), —N(1) < 6’ < 0, and J = (N(J), G). Therefore, 


N(I) < B- 6 =0,, —wh, = VAF. 
If F is complex, and IJ = (N(JI),a), then 
4Np(a) — Tr(a)? = —Apr. 
If I is reduced, then |a| > N(J), and since |Tr(a)| < N(J), then 
—Ar = 4Np(a) — Tr(a)? > 4Nr(a) — N(I)? = 
Ala|? — N(I)? > 4N(I)? — N(I)? = 3N(I)?. 


If I is not reduced, so that 4| Ap and VAp/2 € I, then set 8 = /Ap/2. Otherwise, 
set 6 = /Ap. By Exercises 3.56 and 3.58, N(I) > 8, or I is already reduced. Since 
G € I, there exists an O-ideal J such that IJ = (8), by Corollary 1.7 on page 27. 
Since 6’ = —6, then |Nr(G)| = 8? and N(J) < B. Since I = I’, then J = J’. Hence, 
J is reduced. Since IJ ~ 1, then I? ~ I. However, I? ~ 1 by Exercise 3.47 and 
Exercise 3.20 on page 107. Therefore, I ~ J. 


Let I= (N(J),6+wa,,), and set J = (n,b+wa,,). Since 
N(1) | Nr(b+w,,), and n| N(J), 


then J is an Op-ideal. If n is even, and andA - = 0(mod 4), then (2) | I, contradict- 
ing the primitivity of J. Therefore, since 


ANr(b+wy,) z= (2b + wa, + wh)? — Ar, 


then ged(n, 2b+wy,, +w),,) = 1. Thus, by the multiplication formulas given on page 
48, 
pe = 
J° = (n*,b+wa,) = 1. 
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4.1 We have that (—3/p) = (—1/p)(3/p) = 1 if and only if (—1/p) = (3/p) = —1 or 
(—1/p) = (3/p) = 1. Thus, from the hint, (—3/p) = 1 if and only if either p = —1 
(mod 4) and p = +5(mod 12), or else p = 1(mod 4) and p = +1(mod 12). In other 
words, (—3/p) = 1 if and only if either p = —5(mod 12) or p = 1(mod 12), and this 
holds if and only if p = 1(mod 3). 


4.3 Since (—11/p) = (—1/p)(11/p) = 1 if and only if (—1/p) = (11/p) = —1 or (—1/p) = 
(11/p) = 1, then (—11/p) = 1 if and only if either p = —1(mod 4) and p = 1,3,4,5,9 
(mod 11), or else p = 1(mod 4) and p = 1,3,4,5,9(mod 11). In other words, 
(—11/p) = 1 if and only if either p = 3,15, 23, 27,31 (mod 44) or p = 1,5,9,21, 25 
(mod 44), and this holds if and only if 


p = 1,3,5,9, 15, 21, 23, 25,27,31 (mod 44). 


By Corollaries 1.1-1.2 on page 13, Theorem 1.28 on page 45, and Theorem 3.6 on 
page 103, we have that h_11 = hgj4.y—a1)/2] = 1. Thus, by Theorem 4.1, if (Ar/p) = 
(—11/p) = 1, then p = a?+ab+30? for some integers a, b. Also 11 = (—1)?—1-243-2?. 
Conversely, by Exercise 3.9, if p #4 11 and p= a? + ab + 3b”, then (—11/p) = 1. 


4.5 Given that (—43/p) = (—1/p)(43/p) = 1 if and only if (—1/p) = (43/p) = —1 or 
(—1/p) = (43/p) = 1, then (—43/p) = 1 if and only if either p = —1(mod 4) and 
p = 1,4,6,9, 10, 11, 13, 14, 15, 16, 17, 21, 23, 


24, 25, 31, 35, 36, 38, 40,41 (mod 43), (S13) 


or else p = 1(mod 4) and (S13) holds. This implies that (—43/p) = 1 if and only if 
either 
p = 11,15, 23, 31, 35, 47, 59, 67, 79, 83, 87, 95, 99, 


103, 107, 111, 127, 135, 139, 143, 167 (mod 172), (S14) 


or 
p = 1,9,13, 17, 21, 25, 41, 49, 53, 57, 81,97, 101, 109, 


117, 121, 133, 145, 153,165,169 (mod 172), (S15) 
Lastly, (S14)-(S15) hold if and only if 
p =1,9,11,13, 15,17, 21, 23, 25, 31, 35, 41, 47, 49, 53, 57, 59, 67, 79, 81, 


83, 87, 95, 97, 99, 101, 103, 107, 109, 111, 117, 121, 127, 133, 
135, 139, 143, 145, 153, 165, 167,169 (mod 172). 


Now, as in the solution of Exercise 4.3, we have that h_a3 = hgj(14.y=a3)/2] = 1. Thus, 
by Theorem 4.1, if (Ar/p) = (—43/p) = 1, then p = a? + ab + 110? for some integers 
a,b. Also 43 = (—1)?-—1-2+11-2?. Conversely, by Exercise 3.9, if p 4 43 and 
p =a’? +ab+11b?, then (—43/p) = 1. 
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4.7 The following are all the prime values for class number one negative discriminants via 


Rabinowitsch. 
—Ap | a? +24+(1—-Ap)/4 for « = 0,1,..., [[Ar]/4—1] 
3 etatl _ 
7 e+at2 2: 
11 gf? +e+3 3, 5. 
19 et+ats 5, 7, 11, 17. 
43 x +aet+11 11, 13, 17, 23, 31, 41, 53, 67, 83, 101. 
67 x? +a+17 17, 19, 23, 29, 37, 47, 59, 73, 89, 107, 
127, 149, 173, 199, 227, 257. 
163 x? +a4+41 41, 43, 47, 53, 61, 71, 83, 97, 113, 131, 
151, 173, 197, 223, 251, 281, 313, 347, 
383, 421, 461, 503, 547, 593, 641, 691, 
743, 797, 853, 911, 971, 1033, 1097, 1163, 
1231, 1301, 1373, 1447, 1523, 1601. 


4.9 First we note that, using the notation in the proof of Theorem 3.5, 
r: J = (2,1++/-5) 4 (2,2, 3), 


and 
7:I=(1,V—5) 4 (1,0,5) 
where J % 1 in Co,. The latter holds since (1,0,5) and (2,2,3) are reduced forms 


so if they were properly equivalent, then they would be identical by Claim 3.1 on 
page 90. Also, we note that J? = (2). 


For part (a), If p = 1,9(mod 20), then (—5/p) = 1 so by Theorem 1.30 on page 49 
and Remark 1.24 on page 52 (p) = PP’, where P = (p,(b + V—20)/2) and P! = 
(p, (—b + /—20)/2). Now, if P is principal, then P = (a + b/—5) for some integers 
a,b. Thus, 

(p) = PP’ = (a + b/—5)(a — b/—5) = (a? + 5d”), 


so since N(P) = p, then p = a? + 5b?, as required. If P is not principal, then P ~ J, 
so PJ ~ J? ~ 1. Hence, there are integers x,y so that PJ = (x + yv—5), so 


N(PJ) = N(P)N(J) = 2p = a? 4+ By”. 


Thus, both x and y are odd, so 2p = 6(mod 8), whence, p = 3(mod 4), a contradic- 
tion. We have established one direction for part (a). Conversely, if p = a? +5b?, then 
(p/5) = (a?/5) = 1, sop = 1,4(mod 5). Also, since one of a,b must be even, then 
p = 1(mod 4). Hence, p = 1,9(mod 20), as required. 

For part (b), first assume that p = 3,7(mod 20). Then (—5/p) = 1 since (—1/p) = 
—1=(5/p). As above (p) = PP’. If P is principal, then as above p = a? + 5b”, which 
means that p = 1(mod 4), a contradiction. Thus, as above 2p = x? + 5y? for some 
integers x,y. Thus, « and y must have the same parity, so we may select an integer 
z such that « = y+ 2z. Therefore, 


Qp = (y + 2z)? + Sy? = 42? + Ayz + By’, 
and dividing through by 2, we get 


p = 227 + Qyz + 327. 
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We have established one direction for part (b). Conversely, assume that there are 
integers a,b with p = 2a? + 2ab + 3b?. Then 


Qp = x? + By”, 


where x = 2a+b and y = b. Thus, as above, p = 3(mod 4). Also, (—5/p) = 1 by 
Exercise 3.9. Therefore, 1 = (—5/p) = (—1/p)(5/p) = —(5/p), so (5/p) = (5/p) = -1. 
Thus, p = 2,3(mod 5), whence p = 3,7(mod 20), which secures part (b) . 


Section 4.2 


4.11 Since I’2r ~ 1, I” ~ 1, and gced(hg,,n) = 1, then there exist integers x,y such that 
nz +ho,y =1. Therefore, 


I = [rtths py = (I")* (IY)? AS 1, 
as we sought to prove. 


4.13 In Theorem 4.2, let k = —13 = —1—3u? with u = 2, for which x = p™ = 4u?+1=17 
with m = 1 and y = +2(3+ 8-2?) = +70. Thus, p = 2? + 13, and 70? = 17° — 13. 
Thus, (x, y) = (17, +70). 


4.15 As per the hint, a solution (x,y) to (4.2) implies that 


y+Vk = w(utovky3 (S16) 


for a unit w € Or and some u,v € Z. Then w = +¢7 for some z € Z. Since we 
may write z = 3z1 +r where r € {0,+1,+2}, then we may absorb (-te;")? into the 
cube (u + uVk)°, so we may assume, without loss of generality, that w = ef, where 
r € {0,+1,+2}. Given the definition of ¢ and the fact that (T+UVk)~! = T-UVk, 
then we may assume w € {4 : 7 =0,1,—1} if e, has norm 1 and 


w € {el : 7 =0,2,—2} 
if ¢, has norm —1. In either case, w € {e7 : 7 = 0,1,—-1}. 
Case 8.7 w=1 


From (S16), 
y + Vk = (u? + 3uv?k) + (3u20 + ok) Vk, 


so by comparing coefficients of Vk, we have that 


1 = 3u?v + u°k = v(3u? + v7), (S17) 


so v = +1. Hence, multiplying (S17) by v yields 


+1 =v =3u7v? +0°k > k > 1, 


a contradiction. 


Case 8.8 we {T+ UVk} 
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From (S16) we have 


y+ Vk =(TLUVk)(u t+ ovk)? = (T £UVk) G + 3uv2k) + (3u20 + vk) vb) 


= (T(u? + 3uv?k) £ (Uk(3u? + vk) + (T(3u20 + v?k) £ U(u3 + 8uv7k)) Wk. 
Therefore, by comparing coefficients of Vk again yields 
1 = T(3u?v + v°k) + U(u? + 38uv?k). (S18) 


Since k = 4(mod 9) and U = 0(mod 9), then 1 = T? — kU? implies that 
T=+1 (mod 81). 


Hence, by (S18), 
1 = a(3u? + 4v?)v (mod 9), (S19) 


where a = +1(mod 9). 
From (S19), av = +1(mod 9), so 


3u°+4=av=+1 (mod 9). 


Thus, 
3u2 = 4,6 (mod 9), 


which are impossible. This completes all cases. 


Section 4.3 


4.17 According to the hint, if 93° ,(1/7) = d ¢ R. Then there is an N € N such that 
N<d<WN +1. Also, note that 


aren Src Vets seule a ica ee ga 
Y ae ae ue cae le a es: 


j=1 


so each block has a sum bigger than 1/2. Let M € N be chosen such that the number 
of blocks larger than 1/2 satisfies M > 2N. Then 


a contradiction. 


4.19 By Exercise 3.37 on page 129, 1—¢, and 1=G are associates for all 7 = 1,2,...,p—1. 
By Exercise 2.24 on page 68, we have the ideal equation 


p-l 


[[G-@) =©). 


j=l 
However, given the comment on associates, 


(1-G) =(1- G) =(), 


sO 


as required. 
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4.21 


4.23 


4.25 


4.27 


4.29 


4.31 


If k = 1(mod p), then congruence (4.23) on page 153 becomes 
x(¢,' — Cp) =0 (mod (p)). 


In the same fashion as in the elimination of the case k = 0(mod p), we get that p 
contradicting the hypothesis. 


First, we show that 
x x 


We) =a ea 


is an even function, namely that f(#) = f(—a). We have 


e*+1 1+ e* ev +1 
2f(—«x) = (SH) - © (TES) = 0 (SE) =r. 


Therefore, by Definition 4.1, 


ea | 2 yn | ny ..\n 
ea 2 a ap re 
so 
Br n n 
Seer —(-«)"]=0 
n=2 


which implies that each coefficient Bz»,41 = 0, as required. 


This is immediate from Definition 2.8 on page 83 and Definition A.11 on page 327, 
since the different cosets of J in Or form the different residue classes modulo I. 


This will follow from Exercise 2.47 on page 86 once we establish the result for any 
prime power. The integers in Or that are not relatively prime to P* are those divisible 
by ?. There are N(P2~') = (N(P))*~! of these that are incongruent modulo P*. 
Thus, 
1 
&(P*) = N(P)* — N(P)*"* = N(P*)(1-—— }. 
9") = NO)" — NOY = NE) (1- ze) 

Since f is the product of d linear factors in its algebraic closure, then it has exactly 
d roots there. 


This follows from Exercises 4.28 and 4.30. Since the residue classes modulo J, rela- 
tively prime to J, form a group of order ®(J), then 


a? =1 (mod J), 
for any a € Df relatively prime to J. In particular, if J = ?, a prime Op-ideal, then 


®(P) = N(P) —1, so 
aN()-1 =1 (mod P). 
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Section 4.4 


4.33 


4.35 


4.37 


4.39 


4.41 


4.43 


Let F' = Q(a) where a = W—2. Since |Apr| is minimal over all discriminants of bases 
for F over Q, then by Theorem 2.7 on page 71, 


disc{1,a,a?} = D?Ap, 
where D = |Or : Zia]|. Also, we compute 
disc{1,a, 07} = —108 = —2?. 3°. 
Since 
\Or: Z| =|Or : Zlal]-|Zla] : Z| = 3, 
then D must be odd. If D > 1 then 3 | D. Since Z{a] = Z[a + 2] = Z[a], where 


a = a+2, we choose to work with the latter at this stage. Since 3 | D, there must 
exist a 6 € Z[a] such that 

= bo + bya + boa? 

Sg 

where 3 does not divide all of the integers 6; for 7 = 0,1,2. Suppose that 3 | bo but 
34b,. Then B—b9/3 = (b1a+b2a7)/3 € Or. Also, y = b1a?/3 = (8—bo/3)a—a2b2/3 
Or since a/3 is an algebraic integer given that it is a root of 


6 


(32 — 2)? + 2 = 27x — 54a? + 36x — 6. 


Therefore, 
3°Nr(7) = Nr(37) = Np(b1a7) = B38 Np(a)? = —403, 


so 3 | b,, a contradiction. The other cases such as 3 | b; but 3 { bo are handled 
similarly. Thus, D = 1, disc{1,a,07} = Ap, and Z[a] = Op. 


Since, for a primitive cube root of unity ¢3, we have 
Nr(8) = (a+ ba + ca’) (a + b63a + c62a?)(a + bea + clfa?), 
then using the fact that a le = 0 we get 
Nr(8) = (a + ba + ca*)((a? + 2bc) — (ab + 2c*)a + (b? — ac)a’), 


so, by simplifying, 
Nr(8) = a3 — 2b? + 4c? + 6abe. 


Since (57 — 1) | (577 — 1) and 4 | (57 — 1), then (57 — 1)/4 = 19531 | (577 — 1). 


Since 3(3789 — 1) = 3249 — 3 = x3 — 3, where x = 3°°, and Nr(a + 6/3) = a® + 303, 
for F = Q(W3), then Ne(a — ¥3) = x? — 3. An initial run shows that 


gcd (374 — 3, a? + 36°) = 479, 
for a = 14, and b = 185, so 479|(37°9 — 1). 
n = 12358397 = 3361 - 3677. 
n = 74299271 = 7789 - 9539. 
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4.45 Here r = 2, s = —3, and t = 153, for n = 2343. Thus, from (4.52), k = 77, 
m = 277, and c= —6. We select d as in (4.51), to get d= 2. Thus, 


f(x) = 2? +6 with a = V6, 


and F = Q(V6) having ring of integers Z[/6] a UFD. A smoothness bound need not 
be chosen large since an initial run produces 


gcd(a + b- 277, n) =5 for a = 3,b = 1, 


and 
ged(a + b-277,n) = 11 for a= 15,6 =1, 


so 5 and 11 are factors. In fact 
2153 43 = 


5-11 - 600696432006490087537 - 345598297796034189382757. 


Section 5.1 


5.1 That JCIO,KN F is clear. We now establish the reverse inclusion. By Remark 1.13 
on page 26, there exists a 6 € A such that 6J C Op. Thus, by Exercise 3.31, there 


exists an a € A such that 
aDK = BID K. 


Also, by Exercise 3.32, 
BIOKNF=a0KnNF COr(a\NF = BI, 


so DIO KOE C BI, from which we get JOn NF CJ. Hence, IOx NF = J as required. 
The last statement in the exercise follows from the above result since 


J=IOxnNF=JOKNF =J. 


5.3 Suppose that J=[]/_, Pj’ and J = []j_, Q’ ?, where the P; and Q, are distinct prime 
Ox-ideals, with a;,b; € Z, and set frpp(P ne Fis fee i he Suppose further 
that P; = Q; for 7 = 1,2,...,m <n, where “ossibly m = 0, rianine that J and J do 


not agree on any of ne pre factors. Then if pj; = P; N Op and qj; = 9; 1D -p, then 


NK/F (9) NK/F (g )= 1 a3 f5 I q; hg Te (aj+b3)f5 I pa byhj = N¥/F (99), 


jg=mt1 


5.5 By Exercise 1.38, there exists an ideal H such that HJ = (a) for some a € R. There- 
fore, JH Z al, for if a € al, then a = ao for some a € I, so by the cancellation 
law, 0 = 1 € I, contradicting that J A R. If we choose and fix a 6 € A such that 
BJ Z al, we may set y = B/a € F. This forces yJ = (G/a)J ZI. Also, since 8 € H 
and HJ = (a), then for any 6 € J, 6 = ar for some r € R. Hence, 7J C R. 
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5.7 It suffices to prove this for a prime O-ideal J = p by Exercise 5.3. Suppose that 
2 " 
pOx = [] 9)’, where ej = exyr(P;) and g = 9xr(h)- 
j=l 
Then ; : 
NK/F (p) = [] NE/P(2,)% = [] pools = polar eof = p, 


j=l j=l 


by Theorem 5.3 where fj = fx/r(P;)- 


5.9 Set f(x) = eae a;x? with aq #0. Then, if ag = 0, f(p) = 0(mod p) for all primes p, 
so we assume that ag 4 0. If pi, p2,... pz are all of the primes that divide f(a), and 


if c= ao iRise pi, then f(cy) = aog(y) where 


d k j 
gy) = doa; (I> ay 'y’ =1 (mod pj) 
j=0 i=1 


for all i = 1,2,...,k. Thus, p; { g(y) for all such i. Therefore, |g(y)| = 1 for all 
nonzero y € Z. By Lagrange’s Theorem A.7, the congruences 


g(y) =1 (mod p) and g(y) = —1 (mod p) 


each have at most d solutions each for any prime p, so there must exist a y € Z such 
that a plg(y) for some prime p ¥ p; for any 1 <i < k, a contradiction. 


5.11 We use Theorem 1.30 to conclude the following facts. If p = 1(mod 4), then p splits 
in Q(t) C Q(¢an). If p = 3(mod 8), then p splits in Q(V—2) C Q(don), and if p= 7 
(mod 8), then p is split in Q(V2) C Q(Gan). Since p = 2 ramifies, we are done. 


Section 5.2 


5.13 If Ox = Op{al, then OF = OxK/m’, p(a), by Theorem 5.8, so 
K a,F 


(9%)? = OF mM, p(a) = Ox, p(a). 


Conversely, if OF, = Ox /mi, -(a), we need only show that any 6 € Dx is in Opal. 
However, by the Lagrange Interpolation Formula, there is a polynomial f(x) € F [a] 
such that f(a) = 6. Hence, 8 € Fla]. Therefore, 6 € Fla] N Ox = Op|al. 


5.15 We use induction on n. If n = 1, then the result is clear. The induction hypothesis is 
that the result holds for n — 1. Since 


d(a") = ad(a"") + a"~1d(a), 


then 
d(a”—!) = (n— 1)a”"~70(a), 


by the induction hypothesis, so 


Solutions to Odd-Numbered Exercises 397 


5.17 By employing Theorem 5.7, it is straightforward to verify that I* = (4, a0, Thus, 
I*~! = (20,4,/10). By part 1 of Lemma 5.4, 


I*! = (20,4V10) = 2V10I = Drjg(I) = ID ro, 


so Dryg = (2V10). Thus,A pg = (40) = (Ar). Note that Np(2V10) = —40, but as 
ideals, (—40) = (40), whereA » = 40 is also given by Application 2.1 on page 77. 


Section 5.3 


5.19 (a) Since N*/¥ (I), N¥/ (J) are Op-ideals, then the result follows from Corollary 
1.7. 


(b) If the desired norms are not relatively prime, there is a prime D-ideal p dividing 
both. Thus, pO x | N*/F (J) Ox, so pO x is relatively prime to J. Therefore, no prime 
above p occurs in the factorization of J. In particular, p does not divide N*/" (I), a 
contradiction. 


5.21 This proceeds much the same as in the proof of Theorem 5.12 on page 214. Let 
a € Ox,. Therefore, mo,K, (x) | Ma,F(£), so there exists a monic f(x) € Ox,[z] such 
that 

Me,F(L) = Mo, K(x) f(x). 
Thus, 
ma p(@) = Ma K(O) F(a), 


som, p(x) € DrjK,- Therefore, by Theorem 5.9, Dx,/r C Dr/x,, namely 
Dyk, | Dx,/FOL- 


5.23 Suppose that pO; = al 1 for distinct prime O;-ideals P;. Since we have that 


p{ N/K (j,), then P; { fa for all such 7. By the Chinese Remainder Theorem for 
ideals, there exists a 6 € Ox[a] such that 


6B =0 (mod f,) and 6=1 (mod P;) for all 7 = 1,2,...9. 


We may let 6 = g(a) for some polynomial g(x) € Ox|x]. Then, for any y € Or, we 
may write y = h(a)/g(a) for some h(x) € Ox[x]. Since gcd(6Oz, p07) = 1, there 
exists an n € N such that 

g(a)” =1 (mod pO_z). 


Therefore, 
y= g(a)"'h(a) (mod pz). 


By setting k(a@) = g(a)"~h(a), we get the result. 


5.25 Clearly we have 
la + B| < max{al, |8]} = [5]. 


Also, since 6 = (a+ 8) —a, then 
|8| < max{|a + 6],| — al}, 
and since |a| = | — al, the latter implies that || < |a + 8], so 


la+ 6] = |6|. 
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5.27 (a) Since y(1) £0 and y(1) = x(1- 1) = x(1)x(1), then x(1) = 1. 
(b) Since at = a for all a € Fy, then a7~! = 1 for all a € F%. Thus, 


1 = x(1) = x(t") = x(a). 


(c) Since 
1 = x(1) = x(a~*a) = x(a) x(a), 
then x(a~') = x(a)~. By part (b), |x(@)| = 1, so x(a) = x(a)7?. 
5.29 We have 
xXA(ab) = x(ab)A(ab) = x(a)x(B)A(a)A(D) = x(a) A(a)x(B)A(B) = XACA)XA(B), 


so yA is a character. Also, 


x7" (ab) = (x(ab))~* = (x(a)x(8))7* = x" (a)x7"(0), 


so y~! is a character. That €h (Fj) is a group now follows from Proposition A.1 on 
page 321. 


5.31 Let S= De xeen ex) x(a). Since a £ 1, then by Exercise 5.30, there exists a \ € Ch(Ff) 
such that A(a) 4 1. Thus, 
Ma)S= DP Aaxla= YY Arla =S, 
xECH(FF) xeCh(Fy ) 
so (A(a) — 1)S = 0, which implies that S = 0 since A(a) — 1 4 0. 
5.33 If p|k, then ¢J* = 1 for all j, so 


cy -$5(2)-0-(S)en 


—see [53, Exercise 4.5, p. 187], as well as Exercises 5.27 and 5.31. 
If p{k, then 
k Pe Re 
(Z\em=> (F)at=¥ (2) gu 
P ga ©? eo \P 


where the penultimate equality comes from the fact that @ goes over all residues 
modulo p as jk does. Hence, 


5.35 This is immediate from Exercises 5.33-5.34. 
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5.37 


5.39 


If p is unramified in K; for 7 = 1,2 and P is any prime Ox-ideal over p, we need 
only show that ex, %,/r(P) = 1. By Exercise 5.8 on page 195, there exists a normal 
extension field Z of F' containing K,K2. Let Q be a prime O,-ideal above P. Then 
To(L/F) is the is the inertia group of Q with inertia field To(L/F). By part (c) of 
Corollary 5.17 on page 227, K; C To(L/F) for 7 = 1,2. Since K)K2 is the smallest 
field containing both Ky and Ko, then KiK2 C To(L/F), so ON Ki Ky = P is 
unramified over F’. 


To prove the last assertion, use the above argument with Do(L/F’) taking the role of 
To(L/F). 


Since Nr/g(A) = p, by Exercise 2.24 on page 68, there exists an a € Z such that 
7 =a (mod A) 
by Exercise 4.32 on page 164. Hence, 
>? =a? (mod X?), 
so by taking z = a?, we are done. Since p = \?~!u for some u € Up, then 
y? =z (mod p) 


as well. 


Section 5.6 


5.41 


5.43 


5.45 


First we show that there exists exactly one prime 9 r-ideal p above p, which is totally 
ramified in F’. Let p, and p2 be two prime O--ideals above p. From Lemma 5.7 on 
page 221, we know that 


Dy, (F/Q) = Dp, (F/Q) = Dp(F/Q), 


—see also Remark 5.5 on page 222. By Theorem 5.4 on page 189, p is unramified in 
T,(F/Q) = T, so p{ Ar by Corollary 5.8 on page 210. By Theorem 3.15 on page 126, 
Ar | Ar, which is a power of p. Hence, g{ Ar for any gq 4 p. Therefore,A 7 = 1 and 
T = Q by Corollary 5.9 on page 213. Also, since Z = Z,(F/Q) C T, then Z = Q, so 
p is fully ramified in F’. 

Since J,(F'/Q)/Vi is cyclic by part (e) of lemma 5.15 on page 247 and since V; = 1, 
by the above, then J,(£/Q) = Gal(F/Q) is cyclic. 


Let +t € Dp(K/F) and o € V;. Then for a € Ox, we have 
art’? = ((a7)’)" = (a7)"* =a (mod ?'t"), 


by the definition of the V;. Hence, tar~! 


Dp(K/F). 


€ Vj, which is therefore normal in 


It suffices to prove the result for A/F totally ramified at P, since the V; for P in 
K/F is the same as the V; for P in K/T where T = Tp(K/F'). Thus, we assume that 
F=T, so |K : F| = ex;r(?) =e. 
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Let a € P— f?, then from Claim 5.7 on page 204, it follows that 
{ia orxicae *} 


is an integral basis for Kp/F,, where Kp = Ox/P and Fy = Or/p with p= PN OP. 
Since 
|Kp: Fy|=e=|K: Fl, 


then Dx = Opa]. Hence, from Theorem 5.8 on page 200 and Lemma 5.6 on page 202, 
we have that 

Ox/r(Q)OK = Deir. 
Thus, P* | Ox/r(a), but P&*t { dK) p(Q). 


Let o € Vj—Vj41. Thus, a—a? € P/t!—PI*?, by the definition of the V;. Therefore, 
s is the exact power of P dividing (a — a7)OK. Since 


Ma,F(£) = II @ ys a’), 


o€Gal(K/F) 
then 
1,F(@) = |] («- 0”), 
oFAl 
so 
5x/r(a) = |] (a- 0”) 
oFAl 
If we let 
OK(a- a’) = On Pr), 
then 
m-1 m-1 
s= 5° s(a) = S> s(o) = SO (Vil - Vj NG +) = 
ofl j=0 gey, j=0 
oEV 541 
(Vol — |Vi]) + 2(1V1| — [Val) + 8(1V2] — |V3]) +--+ + m(|Vm-—1| — 1) = 
m-1 
[Vol + [Vil + [V2] +++ + [Vii] —m= 2 ((Vj] = 1), 
j=0 


which is Hilbert’s formula. 


By part 4 of Lemma 5.4 on page 197, with J = Ox, we get that Tx/;p(Ox) is the 
least common multiple of the O,-ideals dividing Dx p. In other words, the biggest 
D p-ideal dividing Dx/F is Tk/F(OK)OK.- So, if Tk/F(OK) = Op and p | Dir for 
some prime O-ideal p, then Dx p C p. However, from part 4 of the aforementioned 
lemma again, 

Or =Tk/r(Ox) C DKF, 


a contradiction to the primality of p. Conversely, if Tx,/p is not onto, then given that 
it is the lem of the Op-ideals dividing Dx p, there exists a prime 0 p-ideal dividing 
Dr /F- 
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5.49 By Exercise 5.47, p{ Dx pr for any prime Op-ideal. By part 3 of Lemma 5.4, 
Ax/rOK = N*!/¥ (Dxjp)Ox = Diy pOK. 


Thus, if p” | Ax;r, then p | Dr, a contradiction. Therefore, if 0% = Opr[G] as an 
Op[G|-module, then K/F is tamely ramified by Exercise 5.46. 


Section 5.7 


5.51 Let a € N such that gced(561,a) = 1. Thus, since 561 = 3-11-17, then 


a? =1 (mod3), a’°=1 (mod 11), and a'®=1 (mod 17). 


Thus, 
a°©° = (a”)*8° =1 (mod 3), a? = (a'°)°® =1 (mod 11), 
and 
@°6? = (q'®)3> =1 (mod 17) 
Hence, 


a°®' =a (mod 561), 


for all a € N. Finally, since j°°' = j(mod 561) for 7 = 3,11,17, then the result is 
secured. 


5.53 If x = y, then 


; S- a a = 1 Se 1=1. 


ack, ack, 


Ifa Ay, then 8 = a(x — y) ranges over Fy as a does. Therefore, 
1 , a(x— 1 2 1 
= ee eS Ge Ci) =0, 
qd ack, q BEF, q 

where the last equality comes from Exercise 5.52. 


Section 6.1 


6.1 Let 7 be a generator of F7 (via the hint, see Theorem A.8) and set x = 7“ and a = 7. 


Then x” = a if and only if y* = 7”, and this is equivalent to saying that 
a=bn (mod q-1). 


Via the hint, by Theorem A.24 on page 340, the latter holds if and only if a = gt for 
some t € Z. Hence, x” = a(mod p) has a solution x € Of if and only if 


Qi D/I = (y2)(a-D/s = (y9t)G-D/5 = 4-7 (mod p). 


6.3 By Proposition 6.1, 
na NFQ)-1)/3 = (5) (mod 2), (S20) 
3 


Also, by Exercise 6.2, 


(2) = 1 if and only if 6? =2 (mod n), 
TS 
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Therefore, 


6°? =2 (mod 7) if and only if (5) =1, 
3 
by the Cubic Reciprocity Law. Thus, by Congruence ($20), this is equivalent to 


a =1 (mod 2). 


6.5 If a = a + b¢3, then the associates are: a = a+ 003, Ga = —b+ (a— b)G3, Ga = 
(b— a) — a3, -a = —a — 663, —G3a = b+ (b— a)C3, and —C2?a = (a — b) + als. 


6.7 


Joex)= YD xaxto= LY x({)=-Ex(4). 
a+b=1 atb=1 aAl 
a,beFg b40 


If we set a/(1— a) =candc#1, then a=c/(1+c). Hence, 


hic aay Lue Cor 


and by Exercise 5.28, this equals —y(—1). 


6.9 By Exercise 5.30, there exists a character of order n on F,. Clearly, there are at most n 
distinct characters of order dividing n. Thus, x? for 1 < j < n are all of the characters 
of order dividing n. If a € Fi and f(x) = 0 is not solvable in F,, then by Exercise 5.32 
there exists a character x of order n with x(a) # 1. Thus, if we set S = D7", x(a), 
then trivially Sy(a) = S, so S(y(a) — 1) =0. Since x(a) £1, then S=0= Ny». If 
a € F} and f(x) = 0 is solvable in F,, then there exists a 8 € F, such that 6" = a. 
Therefore, 


x(a) = x(B") = x(8)" = 1, 


since x” = €. This implies that 


Lastly, if a = 0, then 


> 2 x(a) = €(0) =1= Npy. 


This proves the first assertion. In particular, if p > 2 =n, let yp(a) = (2). Then 


Nyy = €(a) + xp(a) = 14+ (<) 


6.11 Let g(x) = 2? —a and h(x) = 2? — b. Then 


Nyp = SZ Ng.pNn,p; 


a+b=1 
a,beF, 
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so by the last part of Exercise 6.9, this equals 


1 


220.4 ale. 


a 


a,beF, 


A fact from elementary number theory is that the first two sums are equal to zero 
(see the solution to Exercise 5.33), so we have only to evaluate the last sum. For this 
we employ Exercise 6.7. 


Joox!) = =x(-1) = = (=) = ye, 


Pp 


from which the result follows. 


6.13 If (p—1) | k, then since x?~! = 1(mod p), by Fermat’s Little Theorem, we have that 
the sum is equal to p — 1 = —1(mod p). Now assume that (p — 1) {k, and let a € FF 
be a generator—see Theorem A.8 on page 331. Then 


p-1 p-2 p-2 k(p—-1 
= Pp 
k jk ky\j _ 1- ak @-)) 
Lt = a — (a ) =a 1 _ ak ’ 
g=1 j=0 j=0 


where the last equality follows from Theorem B.4 on page 347. By Fermat’s Little 
Theorem, the numerator vanishes, but since (p — 1) { k&, then the denominator does 
not. 


6.15 This follows since P? = (10 + 3/—27), and 7° = 10? + 27- 3?. 
Section 6.2 


6.17 By Proposition 6.2 on page 278, 
2 
e #2 (=) = Ne™)-0)/2 (nod 7). 
: 4 


T 


By taking complex conjugates, we get the same congruence modulo 7. Thus, 


2 
(2) 7 = al¥eO-9P = gD? = (<) paaag. 
4 Pp 


Therefore, 


6.19 The proof is essentially the same as that given in the solution of Lemma 6.1 on 
page 263. 


6.21 By part (a) of Lemma 6.2 on page 264, 
G(x) = GOC)IOGX). 
However, by Exercise 5.54 on page 260, and the fact that »? = y~ 
G(x") = vB, 


2 — x, we have 


and by Lemma 6.5 on page 280, 
F(x%x) = (-1) Ot /4n, 
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6.23 Since p is a prime in Zi], then (2), = 1 by Exercise 6.16, so by part (b) of Proposi- 


P 
tion 6.3 on page 279, a is a quartic residue modulo p. Furthermore, if 


a=z* (mod p) 


for some z € Z, then certainly (F) = 1. Conversely, if (¢) = 1, then a = 27 (mod p) 
for some z € Z. By the part just proved, a is a quartic residue modulo p, so 


a=a* (mod p) 


for some a =a+bi € Z[i], as well. Thus, a? = +z(mod p). Hence, for some ¢,d € Z, 
we have 


a? — b? + 2abi = +z +4 (c+ di)p, 


so by equating coefficients, 


a? —b?+2+pc=0, and 2ab = pd. 


Therefore, since p is odd, then either p | a or p | b. If p | a, then 


b? = See (mod Pp), 


so 
a=a*=2z7=b* (mod p). 


Similarly, if p | b, then 


a=a‘=a‘ (mod p). 


6.25 By Exercise 6.17, (=4), = +1, so by Exercise 6.18 we must have (=), = 1, since 


(2) =1 from (A.10) on page 342. Therefore, 


| () -(2), 


and by Exercise 6.17 and Proposition 6.3, this equals 


(42*) -(€),-@,-@). 


as required. 


6.27 By part (a) of Lemma 6.2 on page 264, 


G(x)* 


T(x)? = G2? 


However, since x? is a quadratic character, then G(x?) = +,/p, and by Claim 6.1 on 
page 265 


GOd)* = x(-1)PI (xx) I0GX 7): 
Thus, 
T(x)? = x(-DI(%Gx ) I (xx); 


so dividing through by J(x,x ) then multiplying through by y(—1) we get, 
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which is part (a), by Lemma 6.5 on page 280. For part (b), we use part (a) in the 
following. 


FP X7) = IOP OP?) = PF (-DIO?, x) = x(-DIOGx) = 7, 


where the last equality comes from Lemma 6.5. Therefore, y(a) = a'?-)/4 = a™ 
(mod 7) implies 
p-1 


—J(x3,x7) = > "(1-9)" (mod 2), 


j=l 
and by the Binomial Theorem this is congruent to 
p-l m mM, m 2m p—l1 
De ag yi-n*( ‘ ee = y-ae( ‘ ) Spe (mod T). 
j= = k=0 j=l 


However, by Exercise 6.13 on page 277, Sen jem = 0(mod p) for any k < m, since 
in that case, 5m —k <p—1. Also, if k =m, then a gom-k — —1. Hence, 


Therefore, 
aoe 3 m[{2m 
2a=74+7=7 = J(x",x) = (-1) (mod 7). 
m 


By taking complex conjugates, we also get 


Thus, 


which is part (b). 
Section 6.3 


6.29 By Exercise 5.34, quadratic Gauss sums are pure. However, suppose that x has order 
k > 2, q = p and g is a primitive root modulo p such that g = 1(mod 4k”). Let 


oq € Gal(Q(C4x2p)/Q) be defined by og(Cax2p) = (Gan2p)%. Since G(x) € Q(Gkp), by 
part (a) of Proposition 6.7, then by Exercise 5.52, 


(GQx)*)” = G00)", 


so since Og|g(¢,,) € Gal(Q(Cpx)/Q(Ck)) for g = 1,2,...,p— 1, then G(x)* € Q(¢k). 
Thus, if r = G(x)/\/p is a root of unity, then rth = 1 since r2* € Q(¢,). Thus, 
r°s =r. However, 


— poy — EO)? _ XING) _ an, 
= Vp"! +\/p +X(g) . 


Therefore, y(g) = +1, contradicting that y has order k > 2. 
This solution is due to R. Evans [17]. 
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6.31 If |Ap| = ¢ > 3 a prime, then choose r € Z such that r # 1(mod £) and (4) = 1. 


Then rR = R(mod £). Thus, ¢ | R(r—1), which forces ¢ | R. But R+ N= a =0 
(mod £), so @ | N. Now letA p = dod, where dp is a discriminant, with |d9| an odd 
prime, or one of 4 or 8, and d; € Z with |d,| A +1. Let 


fi Re (Z/|dolZ)", 


be the natural map r+> 7. Then f is onto since if (4) = —1, then (Asie) =-l, 
given that (Az) = 1, so all elements of (Z/|do|Z)* are covered. Hence, 


| ker(f)| = e(1di|)/2. 


Therefore, among the ¢(|Ar|)/2 elements in R, exactly $(|d1|)/2 reduce, modulo |dol, 
to a given element in (Z/|do|Z)*. Hence, if |do| is an odd prime, then 


R= 5 old) S* a=0 (mod |do)). 


If do = —4, then R = ¢(d,)(1 + 3)/2 = O(mod |dol), and if do = +8, then R = 
o(d1)(1 +3 +5+7)/2 =0(mod |do|). Since do | R for all odd primes and for 4 or 8, 
when they occur, then |Ap| | R. Similarly, |Ap| | N. 


The above solution is due to Lemmermeyer [38]. 


Section 6.4 


6.33 By Example 5.8 on page 190, 


(1—¢,)""* = (r) 
in Dp, so 
(1—¢,)° =(1-¢) 
for all o € Gal(F/Q). Thus, 
(1— Gr)? € (1 - Gr). 
By Exercise 6.32, 6(a) = +¢/a7 =+1(mod r). Since a is primary, then 
a=z (mod (1—-¢,)?), 


for some z € Z, so 

a? = 27 = zh sat) = 2r-1)/2 = (41)" = 41 (mod (1-¢,)?), 
where the equality comes from Theorem B.4 on page 347. Therefore, 
l=+G/a7=+¢) (mod (1—¢,)”), 


sO 


¢» =+1 (mod (1 —-¢,)?). (S21) 
Therefore, since ¢, = 1—(1—¢,) and j > 1 by (S21), then by the Binomial Theorem, 
4+1=@2=1-j(1-¢,) (mod (1—¢,)?). 
If the plus sign holds, then (1 — ¢,) | 2, sor | 2 by Exercise 2.46 on page 86, a 
contradiction since r > 2. Thus, 
1—j(1-¢,) =1 (mod (1-¢,)’), 


which implies that (1 — ¢,) | j, from which we get that r | j by Exercise 2.46 again. 
This means that ¢? = 1, so we have the result. 


